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Misguided cancer goal 


Aninfluential US advocacy group has set a deadline to beat breast cancer by 2020. But it puts 
public trust at risk by promising an objective that science cannot yet deliver. 


setting of goals, and the drive to reach them, is to be com- 

mended, and cancer is no exception. But a 2020 deadline for 
‘ending’ breast cancer that former US President Bill Clinton endorsed 
earlier this month is misguided. Like other ‘beat cancer’ deadlines 
that are regularly floated, it is potentially harmful to the public trust 
that underpins the whole research enterprise, not to mention to the 
patients who understandably cling to hope, whatever its validity. 

Clinton, who lost his mother to breast cancer, has become honorary 
chairman of a two-year-old campaign by the National Breast Cancer 
Coalition, which declares on its website that it has “One Mission: To End 
Breast Cancer by January 1, 2020” The advocacy and research-funding 
organization, based in Washington DC, adds that it has a “strategic plan” 
to achieve that mission, by focusing on prevention and on eliminating 
the metastatic form of the disease, which is what kills. 

The coalition provides a 4.5-page “blueprint” that is long on aspi- 
ration and short on scientific detail. For instance, it declares that by 
2020 “we must understand how to prevent people from getting breast 
cancer in the first place”. This goal leans heavily on the development of 
a preventive breast-cancer vaccine. A research plan for this is said to be 
“in place” and will serve as a model for other, “catalytic projects”. These 
could include exploiting the role of viruses and inflammation in breast 
cancer, and targeting the immune system to prevent metastasis. 

Ambitious goals are perfectly defensible, and indeed desirable, when 
we have the means to achieve them. The campaign to eradicate small- 
pox made eminent sense once a vaccine was ready, as does the goal of 
eliminating polio. Yet the thorny problems of finishing off even polio, 
for which we have had a vaccine for nearly 60 years, provide a caution- 
ary tale about the advisability of setting out to eliminate any disease. 

This is particularly true of the myriad diseases we collectively call 
cancer, the complexities of which we have scarcely begun to fathom. 
Consider just one study, published earlier this year (P. J. Stephens et al. 
Nature 486, 400-404; 2012), which analysed protein-coding genes in 
breast cancers from 100 different women and found no fewer than 
40 different mutational drivers of the disease. These were found in 
73 different combinations in the 100 patients, who each had between 
one and six mutations. The low-hanging fruit here is scarce: only 28 
of the patients harboured just one mutation, and finding a targeted 
therapy for even these single-mutation cases will be a daunting task. 

Added to that is the disease’s intractability. It cannot be banished like 
smallpox; our biologies are by definition vulnerable to a disease that 
has infinite manifestations profoundly rooted in our genetics. Even if 
a panoply of promising therapies were available, the eight to ten years 
it takes to complete a clinical trial makes a 2020 deadline impossible. 
As for prevention, truly valuable trials require not years but decades, 
because of the various influences on breast-cancer development during 
alifetime. Britain’s Breakthrough Generations Study, which recruited its 
100,000th participant in 2009, anticipates running for 40 years. 


| | ope is not a good strategy, in life or in disease research. So the 


The National Breast Cancer Coalition counters that such arguments 
cater to those content with the status quo — what the coalition sees as 
the drift of a research enterprise that, after decades of investment, is not 
motivated by sufficient urgency. On the contrary: we are all for urgency, 
but in the service of goals that are within the realms of possibility. 

Here are a few. Set out to identify all tumours in which the 
HER2 gene is mutated and treat them with the drug Herceptin 
(trastuzumab) by 2020. The treatment is known to work for this 
genetic category of the disease, so this is 
not inconceivable. Or declare that in five 
years, we will have developed several robust 
breast-cancer models that could rapidly 
be deployed to evaluate the functional sig- 
nificance of the mutations and polymorphisms that genomics is 
uncovering at a breathtaking rate. A project such as this, with finite 
parameters and price tag, can be pegged to an achievable time frame. 

Or, tackle another cancer afflicting women by campaigning to over- 
come the apathy with which the human papillomavirus vaccine has 
been greeted in the United States. Universal vaccination of 11- and 
12-year-old girls against the cervical-cancer-causing virus would, at a 
stroke, provide huge gains against the roughly 4,000 deaths and 12,000 
new cases of this cancer that are seen in the United States each year. 

Discovery does not answer to deadlines, and campaigns that pre- 
tend that it does risk wasting public trust, whether from the taxpayers 
who support the US National Institutes of Health or from the millions 
of donors who give to dozens of disease-advocacy groups. There is a 
fine line between creating a sense of urgency and promising too much; 
it is best to stay on the side of the line that is realistic about how science 
works, and about what is currently achievable. m 


“Discovery does 
not answer to 
deadlines.” 


A way to buy time 


With climate talks inching along, gains in energy 
efficiency could slow the rise in emissions. 


once again to discuss global warming. With commitments 

under the Kyoto Protocol ending this year (see page 653), one 

key goal of the United Nations meeting in Doha is to make progress 

towards the 2015 signing of a new global climate treaty, to take effect 

by 2020. The world is on track for a temperature increase of up to 4°C 
by the end of the century, but the UN hopes to limit that to just 2°C. 

Unfortunately, diplomacy and global warming operate on incom- 

patible schedules. An eight-year wait for action would seem to put the 


ik week and next, diplomats from around the world gather 
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warming goal firmly out of reach. But there are ways to buy time for 
global diplomacy, and energy efficiency is at the top of the list. 

The World Energy Outlook 2012 report from the International 
Energy Agency (IEA) suggests that the global infrastructure could lock 
in enough carbon emissions by 2017 to exceed the 2 °C goal, unless 
facilities such as power plants, factories and buildings are expensively 
retrofitted or prematurely retired. But the IEA found that improving 
energy efficiency could give the world another five years to change 
course and begin the transition to renewables and other low-carbon 
energies. 

Globally, energy use is projected to increase by more than one-third 
by 2035, despite promises by Japan, Europe, China and the United 
States to curb demand. In an ‘efficient world’ scenario, with more 
countries embracing bigger efficiency goals, the projected energy 
demand could be cut by half. For perspective, the IEA estimates that 
the modest efficiency increases achieved between 1980 and 2010 
reduced global energy demand by 35% — roughly equivalent to the 
energy currently consumed by China and the United States combined. 

The IEA suggests that more-aggressive efficiency measures, such as 
a broad shift toward efficient appliances, vehicles, homes and facto- 
ries, would cost an extra US$11.8 trillion between now and 2035. But 
the pay-off would be substantial: direct fuel expenditures would fall 
by $17.5 trillion, and investments in energy infrastructure by nearly 
$5.9 trillion. Those savings would be reinvested elsewhere, helping to 
increase global economic output by some $18 trillion. Unfortunately, 
the potential gains are dispersed throughout a complex marketplace 
that tends to reward short-term thinking. 


Governments must pursue solutions at all levels, and not wait until 
the next global treaty. Reducing subsidies on fossil fuels would cut 
energy consumption, for instance, as would increasing consumption 
taxes. High energy taxes help to explain why Japan and Europe are 
leaders in energy efficiency, just as increasing oil prices on the global 
market have encouraged Americans to reduce their oil consumption. 

But playing with the price won't work if the signals aren't reaching 
the right people. Buildings are responsible 


“Improving for roughly one-third of global greenhouse- 
ener; 8y gas emissions, but builders have no incentive 
efficiency could to invest in energy-efficient technologies if 
give the world tenants and owners will foot the energy bill. 
another five To change that, governments can strengthen 
yearstochange _ building codes for new construction and 
course.” create financial incentives that reduce the 


up-front costs of retrofitting. They can also 
require energy audits when properties are sold; this encourages buyers 
and sellers alike to consider long-term operating costs. 

In Doha and beyond, negotiators must look for opportunities for the 
world to embrace new and more ambitious climate goals. At the same 
time, governments must do everything they can to follow through with 
their own climate commitments, reduce carbon footprints at home 
and lay the groundwork for future steps. Stabilizing the climate will 
require monumental efforts on all fronts, and governments should 
recognize that money spent now on curbing greenhouse-gas emissions 
is a long-term investment that will pay off down the road. Nowhere is 
this clearer than in the arena of energy efficiency. m 


A bleak Horizon 


Researchers should lobby against heavy 
cuts to pan-European research funds. 


walked away from talks last week without a deal on the Euro- 
pean budget for the rest of the decade. The breakdown casts 
into limbo a European Commission proposal to apportion around 
€80 billion (US$104 billion) to research over the period 2014-20 —a 
€29.5 billion rise on Europe's current seventh Framework programme. 
And it augurs trouble for research when the impasse is finally broken. 

With 27 nations each pushing for their own priorities, finding an 
agreement on spending plans is inevitably complex, and the tight eco- 
nomic climate aggravated the differences even more than usual. 

The key divisive factor is the demand from wealthy nations, 
including the United Kingdom, Germany and the Netherlands, for 
substantial cuts to the total €1.025-trillion European Union (EU) 
budget — a rise of around €50 billion on spending between 2007 and 
2013 — proposed by the European Commission. Early in the talks, 
European Council president Herman Van Rompuy, who is chairing 
the negotiations, proposed a cut of €80 billion. Media reports say that 
rich nations are looking for further cuts, of between €30 billion and 
€75 billion. Speaking to reporters after the talks broke down on Friday 
afternoon, Van Rompuy said that member states had found a “suf- 
ficient degree of potential convergence” to make an agreement on the 
budget possible early next year. 

This should leave enough time for the European Parliament, 
member states and the commission to thrash out the final details of the 
research programme, known as Horizon 2020, just in time for research 
projects to start in 2014, as planned. But that is one of the few bright 
spots in the outlook for research. 

Of the cuts suggested by Van Rompuy, the Horizon 2020 research 
programme comes out among the worst, with a proposed 12% 


A fter much posturing and politicking, European leaders 
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reduction in funding, according to calculations by the Initiative for 
Science in Europe (ISE), an independent advocacy coalition of learned 
societies and scientific organizations in Heidelberg, Germany. The Gali- 
leo satellite network, set to rival the US Global Positioning system, faces 
a 10% cut, and the budget for ITER, the world’s largest nuclear-fusion 
experiment, is also under threat. Van Rompuy says member states agree 
that the final budget should encourage economic growth, by focusing 
spending on research and innovation, as well as on jobs. But EU politics 
force other priorities. The sharp cuts for research in the Van Rompuy 
plans allow for more moderate reductions of 3.7% in the budget for 
agriculture to appease France, and of 5.6% to ‘cohesion funds’ meant 
for poorer EU regions, to bring Poland on board with the negotiations. 
If the proposed 12% cut to research funding sticks in the final deal, 
all aspects of the Horizon 2020 programme are likely to suffer equally. 
Unforgivably, this would include the programme's ‘Excellent Science’ 
initiatives, such as the European Research Council (ERC), which funds 
investigator-led frontier research, as well as research infrastructures, 
such as CERN — the world’s largest particle-physics laboratory, near 
Geneva in Switzerland, and the institution responsible for the recent 
discovery of the Higgs boson. The valuable Marie Curie fellowships 
through which young researchers gain support for career development 
and experience working in labs abroad would also be threatened. 
Helga Nowotny, president of the ERC, sees a bleak future for the 
council under the Van Rompuy proposals. She fears that the suggested 
cuts could result in funding for grants in 2014 dropping below levels 
available in 2009-10. Reductions of this magnitude will decimate suc- 
cess rates, particularly for young researchers, for whom other funding 
sources are scarce, she says. This would seriously damage the reputation 
painstakingly built by the ERC since it was founded just five years ago. 
European researchers should do everything in their power to 
articulate the case for Europe’s developing excellence, on which its 
future supply of scientific and technical manpower will depend. They 
should lobby their national leaders and support 
the efforts of the ISE. They can start by sign- 
ing the petition, which had, as Nature went to 
press, collected almost 149,000 signatures, at: 
go.nature.com/s2nm 1w. m 
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WORLD VIEW .cnsicor son 


latest in a long line of messages from US biologists’ main lobby 

group, the US Federation of American Societies for Experi- 
mental Biology (FASEB), about the “devastating” implications if the 
country careers off the ‘fiscal cliff’ in January. 

The fiscal cliffis a set of sharp budget cuts (called sequestration) and 
tax increases that will take effect in January if Congress and the White 
House fail to agree before then on other ways to balance the budget. 

Now, what I want to knowis why science lobbyists in Washington DC 
have spent all summer panicking publicly about a budget plan that many 
of the people they represent would consider the least-bad outcome — for 
both US society and US science — of those on the menu. 

The United States, most observers agree, faces an outlandish deficit. 
This year, US$3.6 trillion (24% of gross domestic product) will be 
spent by the government, but only $2.2 billion 
will be raised in taxes. Scientists know as well as 
anyone that this is unsustainable. 

Wrangling over how to tame the deficit ended 
a year ago, when a congressional ‘super-com- 
mittee’ failed to reach agreement. That left what 
is now known as the fiscal cliff — a fall-back 
arrangement agreed in August 2011 to force a 
better deal. It mandates that unless alternative 
plans are agreed, taxes will rise and across-the- 
board spending cuts will take effect. 

For those of a progressive bent, the fiscal cliff 
has many attractions. First, it spreads cuts evenly 
across all ‘discretionary’ spending — including 
the half that goes to the Pentagon. Second, it pro- 
tects Social Security, Medicare and Medicaid — the linchpins of the 
United States’ threadbare welfare state — from any cuts whatsoever. 
In so doing, it refuses to balance the budget on the backs of the poor. 

Finally, and most importantly, it closes 80% of the deficit through 
higher taxation, and only 20% through spending cuts. That's a sensible 
approach in a country where income tax rates — on the middle class 
as well as on the rich — have grown unfeasibly low. 

The fiscal cliff, then, is a tough budget package that leans firmly to 
the left. How did a right-leaning Congress get there? Well, lawmakers 
never thought it would get enacted. Now they are trying to unravel 
it. And every special interest in Washington DC, from FASEB to the 
National Association of Manufacturers, is keen to lend a hand. 

The scientific societies have been warning all summer that sequestra- 
tion would be a disaster for science, imposing cuts of up to 8% in the 
budgets for 2013. Under this scenario, the National Institutes of Health 
would, if past is prelude, reduce its average annual 
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— Ww Science should be ready 
= to jump off ‘the cliff 


Researchers can find plenty to like in a US budget scenario that scientific 
societies are comparing to the apocalypse, says Colin Macilwain. 


8% drop in grant funding. The nation might, for example, continue to 
slip and fudge into inexorable debt and decline — a bad thing for scien- 
tists as well as the public. Going over the cliff would avert that: even the 
new apparatchiks on the Central Committee of the Chinese Communist 
Party might wake up on 1 January, blink and think: good God, perhaps 
America isn't finished, after all. 

As FASEB and other science supporters know, research and devel- 
opment spending has not and will never veer far from its historical 
level of one-seventh of US discretionary spending. If taxes were raised 
and defence spending cut, the long-term outlook for non-defence 
discretionary spending would brighten considerably. 

After all, the ‘cliff’ isn’t a cliff at all. It is simply a new baseline, with 
proper taxes paid, spending reduced and the poor protected. Once it 
is set, the path may be open for selective spending boosts — including, 
perhaps, in research — as well as tax reductions. 
That’s why people such as Peter Orszag, a mem- 
ber of the Institute of Medicine and President 
Barack Obama’ first budget director, and anti- 
austerity economist Paul Krugman say that going 
over the cliff may be the best path to a reasonable 
budget settlement. 

Concern that the cold bath of spending cuts 
and tax rises will send the economy into reces- 
sion is legitimate. But economists do not actu- 
ally know how fiscal tightening affects economic 
growth. Just last month, the International Mon- 
etary Fund revised its estimate of the fiscal multi- 
plier — the dollars of economic activity generated 
by each dollar of government spending — from 
0.5 to “in the range of 0.9 to 1.7”, admitting, really, that it can't read the 
complex relationship between fiscal tightening and economic growth. 

Sure, Obama and his lieutenants need to say publicly that the nation 
must avoid going over the cliff. In conducting negotiations for a deficit 
reduction that does not savage public spending, however, his willing- 
ness to take the drop is his single most powerful weapon. 

Democratic politicians such as Obama increasingly see scientists 
as part of their constituency (in the final tracking poll by the Wash- 
ington Post and the ABC, Obama beat Romney 60% to 38% among 
voters with a postgraduate degree). So the bleating about the cliff from 
scientific societies merely serves to lessen that resolve. The science 
lobby, in other words, is pushing the president to fold. 

But the president shouldnt give an inch, and, if need be, he should 
be ready to jump off that cliff. As he smiles for the television cameras 
and joshes with House Speaker John Boehner, I hope that Obama 
will quote an old enemy to his new friend. “Go ahead,’ should be his 
whispered message, “make my day.” = 


Colin Macilwain writes about science policy from Edinburgh, UK. 
e-mail: cfmworldview@gmail.com 
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RESEARCH HIGHLIGHTS 


Selections from the 
scientific literature 


BOTANY 


Plant fertilization 
protein found 


Fertilization in flowering plants 
is dependent on a protein that 
is secreted by the egg cell and 
activates incoming sperm. 

Stefanie Sprunck at the 
University of Regensburg in 
Germany and her colleagues 
show that, in the model plant 
Arabidopsis thaliana, the 
arrival of sperm cells near 
the egg causes the release 
ofa protein they call EGG 
CELL 1 (EC1). This triggers 
the redistribution of a second 
protein — one linked to fusion 
of the sex cells, or gametes — 
from inside the sperm to the 
sperm cell surface. 

Sperm cells interacting with 
mutant Arabidopsis eggs that 
have faulty ecl genes failed 
to fuse, and the plant’s pollen 
tubes continued to deliver 
sperm into the embryo sac. 
These results suggest that EC1 
controls gamete fusion. 
Science 338, 1093-1097 (2012) 


CLIMATE CHANGE 


Carbon drop in 
snail shell shock 


Free-swimming snails 
show shell damage in water 
conditions that are likely to 
become more prevalent as the 
climate warms. By 2050, the 
top 200 metres of the Southern 
Ocean are likely to become 
under-saturated in a form 
of calcium carbonate called 
aragonite — a component 
of many shells. If aragonite 
structures are placed in waters 
in which the saturation state 
is less than one, they 
gradually dissolve. 
Geraint Tarling 
of the British 
Antarctic Survey 
in Cambridge, 
UK, and his team 
analysed the shells 


PARTICLE PHYSICS 


Time's arrow in B mesons 


A cornerstone of theoretical particle physics — 
the idea that not all processes run in the same 
way forwards in time as they do backwards — 
has been observed directly for the first time. 

Members of the BaBar Collaboration 
trawled data from their experiment (pictured), 
which ran at the SLAC National Accelerator 
Laboratory in Menlo Park, California, from 
1999 to 2008. The researchers identified 


B-meson decay chains that were time reversals 
of each other, and a comparison of the decay 
rates revealed a strong asymmetry. Earlier 
experiments have caught hints of time-reversal 
violation but failed to distinguish it clearly from 
violations of other fundamental symmetries. 
Phys. Rev. Lett. 109, 211801 (2012) 

For a longer story on this research, 

see go.nature.com/258vei 


of Limacina helicina antarctica 
pteropods (pictured) 
captured from the top 
200 metres in an upwelling 
region of the Southern Ocean 
in 2008. Their shells were 
thinner and more porous than 
those captured elsewhere. 
In the laboratory, eight days 
of immersion in waters with 
a saturation state of 
between 0.94 and 1.12 
produced similar 
levels of damage. 
Aragonite- 
shelled animals, 
important to food 
and carbon cycles, 
may decline sooner 


640 | NATURE | VOL 491 | 29 NOVEMBER 2012 
© 2012 Macmillan Publishers Limited. All rights reserved 


than expected in the Southern 
Ocean, the authors say. 

Nature Geosci. http://dx.doi.org/ 
10.1038/nge01635 (2012) 


ZOOLOGY 


Blue whales 
roll with it 


Blue whales (Balaenoptera 
musculus) can perform 
360° rolls, an impressive 
manoeuvre for the largest 
animal ever to have lived. 
Jeremy Goldbogen of the 
Cascadia Research Collective 
in Olympia, Washington, 
and his colleagues tagged 


blue whales off the coast 

of California with sensors 
that provided information 
regarding the animals’ speed, 
depth and body orientation. 
Of 22 whales tagged for an 
average of 6.7 hours, 11 were 
recorded performing rolls. In 
total, 44 rolls were observed, 
all during foraging. 

The authors suggest that 
these rolls serve a dual purpose, 
allowing the animal both to 
re-orient its body to capture the 
maximum amount of krill prey, 
and to better visualize the prey 
and its surroundings. 

Biol. Lett. http://dx.doi.org/ 
10.1098/rsbl.2012.0986 (2012) 


SLAC NATL ACCELERATOR LAB. 


CARL BUELL/J. VINTHER/BRISTOL UNIV. 


CHEMISTRY 


Instant steam 
from sunlight 


Nanoparticles that efficiently 
absorb light energy and convert 
it into heat can act as miniature 
steam generators in a liquid. 
Naomi Halas, Peter 
Nordlander and their 
colleagues at Rice University 
in Houston, Texas, used lenses 
to focus sunlight on carbon 
or gold-silica nanoparticles 
suspended in water. Within 
a few seconds, steam at a 
temperature well above 100°C 
was generated around the 
particle surfaces and bubbled 
away, whereas the bulk of the 
water heated up only gradually. 
This method of using 
sunlight to generate high- 
temperature steam could 
be used to sterilize waste or 
surgical instruments without 
the need to boil a large 
volume of fluid, the authors 
suggest. The same effect may 
improve distillation: sunlight 
focused on nanoparticles in 
an ethanol—water mixture 
produced vapours richer in 
ethanol than conventional 
thermal distillation. 
ACS Nano hitp://dx.doi. 
org/10.1021/nn304948h (2012) 


PALAEONTOLOGY 


Fossil hints at 
star’s salty past 


Modern echinoderms 
—invertebrates such as 
brittlestars and sea urchins 
— live only in open seas, but 
fossils from Europe suggest 
that this has not always been 
the case. 

A team led by Mariusz 
Salamon of the University 
of Silesia, Poland, examined 
fossils of the Aspiduriella 
similis brittlestar from a quarry 
in southern Poland. The fossils 
were embedded in limestone 
dated to the Middle Triassic 
period, more than 240 million 
years ago. Minerals and 
geological structures within 
the rocks suggest that the 
fossils formed in conditions 
with very high salt levels, such 
as those present in hypersaline 


coastal waters. Few other 
fossils were found in the rock, 
also pointing to harsh living 
conditions. Echinoderm fossils 
have been used as indicators 
of open marine environments, 
something the authors caution 
against. 

PLoS ONE 7, e49798 (2012) 


Switching off 
cancer resistance 


Modern therapies can target 
specific pathways in cancer 
cells, but the cells often become 
drug resistant. René Bernards 
of the Netherlands Cancer 
Institute in Amsterdam and 

his colleagues have identified a 
gene involved in resistance, and 
have found a way to stop it in 
its tracks. 

Resistance can be caused by 
mutations in genes or proteins 
that are not directly targeted by 
a drug. Bernards’ team used a 
genetic technique called RNA 
interference to investigate 
the effects of shutting down 
thousands of human genes. 

The researchers found 
that when the gene MED12 
was switched off, the cells in 
avariety of cancers became 
resistant to a range of anti- 
cancer drugs. Suppressing 
this gene activates the 
transforming growth 
factor § receptor (TGF-BR) 
and, conversely, inhibiting 
the signalling through this 
receptor in drug-resistant cells 
eliminates the resistance. 

Cell 151, 937-950 (2012) 


OPTICS 


Technology for 
thinner probes 


A single optical fibre could 
form the basis of thinner 
endoscopes — long imaging 
probes with medical, military 
and industrial uses. 

Current endoscopes are 
made up of millimetre-sized 
bundles of up to 100,000 fibres. 
Each fibre transports a single 
mode of light wave coming 
from the object being imaged, 
because the mixing of modes 
can cause light-wave distortion. 


RESEARCH HIGHLIGHTS 


THIS WEEK 


COMMUNITY 


CHOICE 


Cancer-drug infertility reversed 


3 HIGHLY READ 
on www.cell.com 
in November 


Transplanted testicular stem cells can 
restore fertility to macaque monkeys made 
infertile by chemotherapy. 


Kyle Orwig at the University of 

Pittsburgh in Pennsylvania and his colleagues took stem cells 
capable of developing into sperm from macaques and marked 
them with a lentivirus. After inducing infertility in the 
12 adult and 5 prepubescent donors, the researchers returned 
the marked cells to the animals. Marked genetic material later 
appeared in the sperm of nine of the adults, and three of the 
juveniles when they reached maturity. 

The team then ran a similar experiment, transplanting cells 
from the donors into other macaques. Of six adult recipients, 


two produced sperm from transplanted donor stem 
cells. Moreover, donor-derived sperm from one 
recipient successfully fertilized eggs to produce 


embryos with a donor father. 


Cell Stem Cell 11, 715-726 (2012) 


Wonshik Choi at Korea 
University in Seoul and his 
colleagues have used a single 
200-micrometre-wide fibre to 
transport multiple modes, by 
measuring and reverse 
engineering the 
distortion that each 
mode suffers. The authors 
used their technique to 
make a three-dimensional 
map of a sample of rat 
intestine. 

Phys. Rev. Lett. 109, 203901 
(2012) 


Birds ofa 
different feather 


The wings of ancient birds 
and feathered dinosaurs that 
lived some 150 million years 
ago may have been less like 
those of modern birds than 
previously thought. 
Contemporary birds share 
acommon wing design, with 
a single feather layer. But 
Nicholas Longrich of Yale 
University in New Haven, 
Connecticut, and his colleagues 
identify separate layers in their 
fossil analyses of the wings of 
the Jurassic bird Archaeopteryx 
lithographica (pictured) and 


the Cretaceous feathered 
dinosaur Anchiornis huxleyi. 

The slender feather shafts 
found in these prehistoric 
plumages would make the 
feathers weak by modern 
standards but, when stacked, 
may have formed a structure 
strong enough to generate lift. 
However, the layers would have 
limited the airflow through 
the wing, which is used by 
modern birds for take-off 
and low-speed flight, so these 
prehistoric flyers probably 
glided or parachuted down 
from trees, the authors say. 
Curr. Biol. http://dx.doi. 
org/10.1016/j.cub.2012.09.052 
(2012) 
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SEVEN DAYS 


EU budget limbo 


Talks between European 
heads of state ended on 

23 November without any 
agreement being reached 

on the European Union 

(EU) budget for 2014-20. 
Negotiations will continue 
next year, but the EU Horizon 
2020 research programme is 
facing cuts of 12% ina deal 
proposed by Herman Van 
Rompuy, president of the 
European Council, who is 
chairing the negotiations. See 
page 638 for more. 


Space deal 


Europe’ research ministers 
have agreed a budget of 

€10.1 billion (US$13 billion) 
for the European Space 
Agency (ESA) for the next 
few years, and have set out the 
next steps for replacing the 
Ariane 5 satellite-launching 
rocket. The settlement means 
that ESA’s science programme 
will have flat funding of about 
€500 million per year between 
2013 and 2017. See page 645 
and go.nature.com/dinqbk 
for more. 


UK energy deal 


Britain’s energy and finance 
ministries have strucka 

deal to support financing 

for low-carbon energy. On 

23 November, the government 
said that utility firms could 
triple customer charges 

that support nuclear, wind, 
solar and other low-carbon 
electricity sources, bringing 
such funding to £7.6 billion 
(US$12.2 billion) annually 
(in real terms) by 2020. The 
United Kingdom wants 30% 
of its electricity to come 

from renewables by then, 

up from 11% today. But the 
deal — which foreshadowed 
an energy bill released this 
week — saw politicians drop a 
proposal to eliminate carbon 
emissions from the electricity 


The news in brief 


Grand Canyon flooded to restore habitats 


A gush of water into the Colorado River marked 
the start of efforts to rebuild beaches and 
sandbars by redistributing sediment along the 
Grand Canyon. Water released from the Glen 
Canyon Dam flowed fastest between 9 p.m. on 
19 November and 10 p.m. the next day, with 


sector almost entirely by 2030. 
See go.nature.com/ly4mhe 
for more. 


Progress on HIV 


In the past two years, the 
number of people accessing 
antiretroviral therapy for 
HIV has increased by 63% 
globally. Moreover, in the past 
six years AIDS-related deaths 
have fallen by one-quarter (to 
around 1.7 million in 2011). 
And in the past decade the 
number of new infections 

has fallen by 20% (to around 
2.5 million in 2011) — with 
declines of more than 50% in 
25 low- and middle-income 
countries, many of them in 
Africa. The encouraging 
statistics were reported on 

20 November by the Joint 
United Nations Programme on 
HIV/AIDS. 
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Polio setback 

The Global Polio Eradication 
Initiative will miss its goal of 
stopping spread of the viral 
disease this year, says a report 
released this week by the 
programme’ Independent 
Monitoring Board. Of the 
four countries where the 
disease still exists, three — 
Afghanistan, Pakistan and 
Chad — are making progress 
against polio. However, the 
fourth, Nigeria, has seen cases 
double between 2011 and 
2012, accounting for more 
than half of the 175 cases 
recorded globally this year. 


UK science advice 
The UK Parliamentary Office 
of Science and Technology has 
been spared heavy budget cuts. 
The office, which provides 
politicians with analysis 


flows reaching around 1,200 cubic metres per 
second, according to the US Department of the 
Interior. The project, which follows 16 years 

of research on the potential effects of the 
releases, aims to restore vegetation and boost 
populations of fish and other wildlife. 


of scientific issues, was 

facing cuts of up to £98,000 
(US$157,000), or 17% of its 
budget. But after a protest 
backed by two former British 
science ministers, the office 
has secured financing until 
April 2015. See go.nature.com/ 
b3gph9 for more. 


| _BUSINESS 
Egg-free flu jab 


A seasonal influenza vaccine 
made without growing the 
virus in fertilized chicken 
eggs was approved for the 
first time by the US Food 
and Drug Administration on 
20 November. Flucelvax uses 
flu strains grown in animal 
cells and is made by drug 
giant Novartis, headquartered 
in Basel, Switzerland. Like 
other flu vaccines, it protects 


B. STRONG/REUTERS 


against the three strains of 
flu predicted to be the most 
prevalent during the coming 
flu season. See go.nature. 
com/1nmiqc for more. 


PEOPLE 
< Ape expert cleared 


z Sue Savage-Rumbaugh, 
former executive director of 
the Iowa Primate Learning 
Sanctuary in Des Moines, is 
to return to the centre after 
an investigation committee 
found no evidence to support 
allegations that she had failed 
to care for bonobos there. 

Ina statement released on 

20 November, the sanctuary 
said that the committee had 
“encountered significant 
counter-evidence against the 
claims”. Savage-Rumbaugh 
had been placed on leave in 
September pending the results 
of the enquiry. See go.nature. 
com/caqgik for more. 


Fraudster punished 


Nutrition researcher 

Eric Smart has agreed to 
refrain from making grant 
applications to the National 
Institutes of Health for seven 
years. The US Office of 
Research Integrity reported 
on 20 November that Smart 
fabricated figures in 10 papers 
and 7 grant applications over 
10 years while he was at the 
University of Kentucky in 
Lexington. See go.nature.com/ 
h4srje for more. 


CKE/TIME LIFE PICTURES/GETTY 


SOURCE: UNEP 


(Gt) of carbon dioxide and 


equivalent gases — but even the 
strictest pledges fall 8 Gt short of 
that target, a gap that has grown 


since last year. 


TREND WATCH 


As climate negotiators gather 

in Doha, the United Nations 
Environment Programme has 
released a report warning that 
pledges to cut greenhouse-gas 
emissions fall short of what is 
needed to have a “likely” (greater 
than 66%) chance of limiting 
global temperature rise to 2°C. 
By 2020, annual emissions must 
be no more than 44 gigatonnes 


Transplant pioneer 
Joseph Murray (pictured), who 
performed the first successful 
human organ transplant, died 
on 26 November, aged 93. 

In 1954, he transplanted a 
kidney between identical 
twins. Murray, a director of the 
Surgical Research Laboratory 
at Harvard Medical School 

in Boston, Massachusetts, 
shared the 1990 Nobel Prize in 
Physiology or Medicine. 


illegal profits 

Sidney Gilman, a neurologist at 
the University of Michigan in 
Ann Arbor, has been charged 
with insider trading that netted 
two hedge-fund companies 

a total of US$276 million. 
Gilman gave hedge-fund 
manager Mathew Martoma of 
CR Intrinsic Investors, based in 
Stamford, Connecticut, early 
news of safety data from clinical 
trials of an experimental drug 
for Alzheimer’s disease. Gilman 
has agreed to pay a settlement 
of more than $234,000, but 


EMISSIONS GAP GROWS 


Martoma faces criminal 
charges. See go.nature.com/ 
n&8tnbi for more. 


EU health chief 


Conservative Maltese politician 
Tonio Borg was approved as 
Europe’s commissioner for 
health and consumer affairs, 
after the European Parliament 
voted in favour of his 
appointment on 21 November. 
Critics fear that the devout 
Catholic could attempt to derail 
European Union funding for 
human embryonic stem-cell 
research. Borg replaces John 
Dalli, a Maltese politician who 
resigned on 16 October asa 
result of corruption charges. 


| _ERESEARCH 
Emissions peak 


The concentration of carbon 
dioxide in the atmosphere hit 
arecord high of 390.9 parts 
per million in 2011, the World 
Meteorological Organization 
reported on 20 November. 
Levels of the potent 
greenhouse gases methane and 
nitrous oxide also reached new 
highs last year. See go.nature. 
com/8hziue for more. 


Contagion concerns 


Four more cases of an infection 
caused by a novel coronavirus 
— the viral family behind 
severe acute respiratory 
syndrome (SARS) — were 
reported by the World Health 
Organization on 23 November, 


The chasm is widening between cuts in greenhouse-gas emissions 
needed by 2020, and those projected in each of the past three years 
on the basis of existing carbon-saving policies. 


2005 


emissions 


2010 


emissions 


45 
GtCO,e 


49 
Gt CO, e 


Annual emissions-gap projections 


2020 target to limit 2 °C warming 44 Gt CO,e 
e 58 Gt CO,e 


susineee! 
as usual 


Gt CO, e = Gigatonnes of carbon dioxide equivalent 


SEVEN DAYS | THIS WEEK | 


3-7 DECEMBER 
Preliminary findings 
from James Cameron's 
dive to the bottom of the 
Mariana Trench, and 
new results from NASA‘’s 
Curiosity rover on Mars, 
are discussed at the 
American Geophysical 
Union's meeting in San 
Francisco, California. 
fallmeeting.agu.org/2012 


5 DECEMBER 

UK scientists begin to 
drill into Antarctica’s 
subglacial Lake 
Ellsworth, buried under 
more than 3 kilometres 
of ice (see Nature 491, 
506-507; 2012). 
www.ellsworth.org.uk 


bringing the tally to six. Three 
cases, including one fatality, 
occurred in Saudi Arabia; 

the fourth was in Qatar. Two 
of the new cases are from 

one household, raising the 
possibility that the virus 

can be transmitted between 
people, not just by contact with 
infected animals. See go.nature. 
com/9f5zwq for more. 


Flight ban flouted 

A beagle breeder has dodged 
an airline’s ban on transporting 
animals bound for research 
labs by stating that the dogs 
would not be harmed. The 
puppies were bound for 
Advinus, a contract-research 
organization in Bangalore, 
India, that uses beagles in 
drug-toxicity work in which 
the animals are euthanized. But 
the breeder, Beijing Marshall 
Biotechnology, a branch of 
Marshall BioResources of 
North Rose, New York, told 
Cathay Pacific airline that the 
dogs would not be hurt or 
killed. The New York firm said 
last week that it is investigating 
the incident. See go.nature. 
com/zurgnl for more. 


> NATURE.COM 
For daily news updates see: 
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ESA’s funding shortfall is bad news for CarbonSat, a mission aiming to track atmospheric carbon dioxide. 


Space budget blow 
to climate science 


Economic difficulties take their toll on European Space 
Agency’s Earth-observation programme. 


BY EDWIN CARTLIDGE 


of last week’s European Space Agency 
(ESA) budget negotiations was better 
than expected, given the continent's economic 
troubles. But for Volker Liebig, ESA’s head of 
Earth observation, there is a sting in the agree- 
ment. The multi-year budget that member 


ee Europe's space chiefs, the outcome 


states approved — which falls some €2 bil- 
lion (US$2.6 billion) short of ESA’s proposed 
spending of about €12 billion — could force 
him to postpone or cancel a mission aimed at 
pinning down the mysterious carbon sinks 
that are slowing the rise of greenhouse gases 
in Earth’s atmosphere. 

Ahead of the budget negotiations in Naples, 
Italy, on 20-21 November, Liebig had hoped to 


secure around €1.25 billion for new research 
satellites. With France, Italy and Spain con- 
tributing much less than expected, he received 
€1.9 billion for Earth-observation projects. But 
€808 million has already been allocated for a 
new generation of weather-forecasting satel- 
lites, leaving him with little more than €1 bil- 
lion for research missions. “We have to discuss 
with scientists in the next few weeks what to 
do,” Liebig says. “But we will not be able to 
develop all the science satellites we wanted to” 

Most vulnerable, he says, is a planned 
€250-million climate-change mission sched- 
uled for launch in about 2018. One of the two 
contenders for the mission, CarbonSat, would 
map atmospheric concentrations of carbon 
dioxide and methane at high-enough resolu- 
tion to investigate a long-standing puzzle: why 
only about half of the CO, emitted by human 
activities remains in the atmosphere. Scientists 
assume that the rest is absorbed largely by the 
oceans and plants, but ground-based monitor- 
ing stations are too few and far apart to pin- 
point the sinks. 

Satellites could fill in the gaps in the picture, 
but in April ESA lost contact with Envisat, the 
one satellite providing such data (see Nature 
484, 423-424; 2012). Neither Japan’s exist- 
ing Greenhouse Gases Observing Satellite 
nor NASAs Orbiting Carbon Observatory-2 
(OCO-2), scheduled for launch in 2014, will 
map greenhouse-gas concentrations in as 
much detail as CarbonSat, which would survey 
the whole globe with a resolution of 4 square 
kilometres. “The information that it would 
collect is essential for developing, implement- 
ing, and monitoring greenhouse-gas-emission 
policies,” says atmospheric physicist David 
Crisp of NASAs Jet Propulsion Laboratory in 
Pasadena, California, who is the science team 
leader of OCO-2. “A timely launch of this sat- 
ellite should be among the highest priorities 
of ESA”. 

CarbonSat’s competitor for ESA funding, 
FLEX, would also help to pin down carbon 
sinks, by measuring the faint fluorescence 
generated by plants during photosynthesis — 
a measure of how efficiently they absorb car- 
bon. “The last thing we want to do is to destroy 

the forests or whatever 
is absorbing almost half 


For more from the of the CO, that we are 
ESA ministerial emitting,” says Crisp. 
meeting, see: “Wouldn't it be good to 


know where these 
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> processes are occurring?” 

However, there was better news for other 
ESA programmes. Europe's Ariane 5 rocket 
launcher, which is more expensive than com- 
petitors, was the focus of fraught discussions: 
Germany argued for a more powerful and ver- 
satile upgrade, whereas France maintained that 
it would be better to switch straight to a new 
and more economical launcher. Following late- 
night discussions, ministers decided to fund 
both designs over the next couple of years and 
to review progress in 2014. 

They also reached a deal on how to 
pay for Europe’s contribution to oper- 
ating the International Space Station 
between 2017 and 2020. The costs will 


be covered in kind by a German-backed 
plan to provide the propulsion and 
avionics for NASA’s Orion manned spacecraft. 
ESA also agreed to Russian involvement in its 
twin ExoMars missions, an ambitious pro- 
gramme of orbiters and landers scheduled for 
launch in 2016 and 2018. NASA pulled out of 
the project earlier this year. 

But ESA’s science programme faces a 
squeeze: it will receive €508 million a year 
for the five-year period from 2013 to 2017. 
Although slightly higher than its 2012 fund- 
ing of €480 million, thanks to the contribu- 
tions from new member states Poland and 
Romania, after inflation is taken into account 
this effectively amounts to a cut. Willy Benz 


of the University of Bern, chair of ESAs Space 
Science Advisory Committee, says that this 
could force the agency to delay a future large 
mission; cancel mission extensions for exist- 
ing probes; or cancel smaller missions. 

Benz thinks that the science programme got 
less than expected because in hard economic 
times spending is channelled towards activities 
that can more directly boost industry, such as 
designing and building new launchers. 

“I would say that the budget outcome was 
the best we could have hoped for given the eco- 
nomic circumstances,’ says Benz. “But if you 
cut budgets in the science programme, you cut 
science. There is only so much you can save by 
reducing travel or not making phone calls.” m 


Brazil has struggled to sustain its production of biofuel from sugar cane. 


Growth of ethanol 
fuel stalls in Brazil 


Shortages are a sobering lesson from a biofuels pioneer. 


BY CLAUDIO ANGELO IN BRASILIA 


C C new moment for mankind.” That was 
A™ Brazil’s former president, Luiz Ina- 
cio Lula da Silva, described his coun- 

try’s biofuel boom in March 2007. Back then, 
Brazil was the poster child of ethanol fuel, its 
output second only to that of the United States. 
Fermenting the sugars in the country’s abun- 
dant sugar cane produced a motor fuel that low- 
ered carbon dioxide emissions, and many saw 


Brazil as a model for how the world could shed 
its addiction to oil, creating jobs along the way. 

Five years on, Lula’s vision has tarnished. 
Biofuels are falling from grace around the 
world as critics charge that devoting millions 
of hectares of agricultural land to fuel crops 
is driving up food prices and that the climate 
benefits of biofuels are modest at best. But the 
fall has been hardest in Brazil, where govern- 
ment policies have compounded the effects of 
the global economic downturn. 
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Domestic consumption of liquid ethanol 
this year has been 26% lower than for the same 
period in 2008. Forty-one of the country’s 
roughly 400 sugar-cane ethanol plants have 
closed over that time. The price of pure ethanol 
at the pump is so high that in most states it is 
cheaper to fill up flexible-fuel cars with petrol 
blends that contain about 20% ethanol. The 
shift back to fossil fuels, combined with rapid 
growth in the number of cars on the roads 
(see ‘Fuelling Brazil's transport boom), has 
worsened city smog and caused emissions in 
the transport sector to spike at about 170 mil- 
lion tonnes of CO, in 2011, up from less than 
140 million tonnes in 2008. “We are increasing 
the world’s GDP: we are buying more oil and 
spending more on pollution-related health 
care,’ jokes Ildo Sauer, who studies energy 
policy at the University of Sao Paulo and is a 
former director of the state oil giant Petrobras. 

Brazil’s ethanol roller coaster is a sobering 
example of what can happen when climate and 
energy planning clash with economic decision- 
making. It began with the 2008 economic crisis, 
which staunched new investments in the sec- 
tor just as it was expanding rapidly, and deep in 
debt. Rather than developing new plantations, 
the industry fell back on harvesting cane from 
older, less-productive sites, and average yields 
plummeted from 115 tonnes per hectare in 2008 
to 69 tonnes this year. Combined with two bad 
harvests, this has forced Brazil to import some 
1.5 billion litres of maize (corn) ethanol from 
the United States over the past 2 years. 

But the killer blow came when the govern- 
ment decided to freeze the price of petrol and 
diesel to keep inflation under control, leaving 
biofuels less competitive. On the very night 
that current President Dilma Rousseff gave 
the closing speech of the Rio +20 confer- 
ence in June — the final agreement of which 
promised to phase out fossil-fuel subsidies — 
the government said it would be reducing a 
federal fuel tax to zero. “We have taken away 
jobs from agroindustry, stalled growth and 
worsened the air of our cities for the sake of 
inflation control,” says Luiz Horta, a bioenergy 


RICKEY ROGERS/REUTERS 


researcher at the Federal University of Itajuba. 

Meanwhile, the government has tried to 
stimulate the economy with tax breaks on 
the sale of new cars. That, combined with the 
cost of pure ethanol, has meant that “the share 
of alcohol in our transport fuel matrix has 
dropped from 55% in 2008 to 35%”, says André 
Ferreira, head of the Institute for Energy and 
the Environment, a think-tank in Sao Paulo. 

According to Antonio de Padua Rodrigues, 
technical director and acting president of 
UNICA, Brazil's sugar-cane industry associa- 
tion, the government knows that the situation 
is unsustainable. It has promised the industry 
that petrol prices will go up next year, and that 
the blend of ethanol will rise from 20% to 25%, 
the maximum allowed by law. But it will take 
time for the industry to bounce back from its 
poor fortune, and ethanol is likely to remain 
scarce and expensive for the next two years, 
say Rodrigues and Horta. 

Now, Brazil hopes to tap into a new biofuel 
source: second-generation ethanol, produced 
from the tough cellulose in plant stalks. Cel- 
lulose is difficult to break down and ferment, 
but several facilities in the United States are on 

the verge of making com- 


> NATURE.COM mercial cellulosic ethanol 
Read more in — for example, by using 
Nature'schemistry specialist enzymes to 
and energy Outlook. break down the long- 
go.nature.com/qGzodw chain cellulose molecules 


— and Brazil doesn't want to be left behind. 

In December last year, the Brazilian 
Development Bank launched a 1-billion-real 
(US$481-million) credit line to stimulate 
research and development in cellulosic biofu- 
els and other advanced sugar-cane technolo- 
gies. The Center for Sugarcane Technology, 
an industry-sponsored organization based 
in Sao Paulo, has taken up a 357-million-real 
loan to build a cellulosic ethanol plant next 
year, which would use waste plant matter 
from conventional sugar-cane fermentation. 
“We can double fuel yield per hectare when the 
technology is mature’, says Oswaldo Godoy, a 
project manager at the organization. 

The Brazilian Agricultural Research Com- 
pany (EMBRAPA) is also throwing its weight 
behind bioenergy. Its president, Mauricio 
Lopes, a geneticist who took office in October, 
has promised to build up research on biomass 
technology and double EMBRAPAs funding 
for that area, which today stands at a mod- 
est 24 million real per year. “I want to believe 
that the current state of the ethanol sector is a 
temporary blip,’ he says. Lopes says that Brazil 
will be “unbeatable” once cellulosic technology 
matures. “No other country has the logistics 
we have in place, or the number of different 
species we can derive ethanol from” 

But cellulosic ethanol won't be a quick fix, 
says Horta. “Nothing shall compete with con- 
ventional sugar-cane ethanol until 2050? = 


FUELLING BRAZIL'S TRANSPORT BOOM 


More vehicles and falling ethanol production is 
prompting a switch to petrol blends. 
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EPIDEMIOLOGY 


Daily dose of toxics to be tracked 


Exposome studies will tie environmental exposure to biological triggers of disease. 


BY EWEN CALLAWAY 


r | Mhink of it as a benevolent Big Brother. 
European researchers are gearing up to 
monitor thousands of people by giving 

them smartphones to record the chemicals to 

which they are exposed every day. 

Two projects this week announced that 
they had won a combined €17.3 million 
(US$22.4 million) from the European Com- 
mission to study the ‘exposome’ — the effects 
of environmental exposures on health. The 
researchers hope that the four-year studies 
will benefit public health in ways that genome 
research so far has not. 

Genome-wide association studies, in 
which scientists search for genetic variants 
linked to disease, have failed to fully explain 
why some people are more susceptible than 
others to chronic diseases, such as type 2 dia- 
betes. “There’s been too much emphasis on 
genetic factors, which contribute relatively 
little to disease compared with environmental 


factors,” says Martyn Smith, a toxicologist at 
the University of California, Berkeley, who is 
participating in the newly funded Exposom- 
ics project. Paolo Vineis, an environmental epi- 
demiologist at Imperial College London, leads 
the €8.7-million project. 

Subjects will carry smartphones equipped 
with sensors to measure exposures, and their 
blood will be analysed to monitor molecular 
changes. Most participants are already involved 
in other long-term health studies. One goal is to 
look for biomarker differences between people 
walking through areas with low air pollution 
and those exposed to urban fumes, in order to 
understand the triggers for conditions such as 
heart disease, asthma and lung cancer. 

Vineis’s exposomics approach has already 
uncovered gene-expression signatures that link 
people’s leukaemia risk with their exposure to 
heavy metals and other toxic chemicals, for 
example. 

The second project, the €8.6-million Human 
Early-Life Exposome, will focus on children 


and pregnant women. Children are more sus- 
ceptible to environmental influences because 
their bodies are smaller and their organs are 
still developing, says epidemiologist Mar- 
tine Vrijheid at the Centre for Research in 
Environmental Epidemiology in Barcelona, 
Spain, who heads the project. The research- 
ers will track disease biomarkers to assess the 
effects of environmental exposures on growth, 
obesity, immune development and asthma. 
Both projects will generate vast amounts of 
data, and Vineis and Vrijheid are developing 
data-sharing policies to enable other research- 
ers to mine the resource. 

Interest in exposomics is also growing in 
the United States. This year, the US National 
Research Council called for greater investment 
in exposome research, and the National Insti- 
tute for Environmental Health Sciences plans 
to make it a priority, although it has yet to invest 
in any projects as large as the European efforts, 
says the institute's David Balshaw. “We see this 
as a major priority,’ he says. m 
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Funding protest hits 
Bulgarian research agency 


Petition alleges that grant competition was mismanaged and cash channelled to bad science. 


BY ALISON ABBOTT 


ulgarian scientists have never had much 
B faith in their research ministry, but the 
outcome of this year’s grant competition 
has provoked a unprecedented storm of outrage. 

On 22 November, a detailed front-page 
report in the national newspaper SEGA pre- 
sented a hair-raising list of allegations, ranging 
from large funding allocations to companies 
and foundations with no experience in scien- 
tific research, to alleged conflicts of interest 
involving geologist Rangel Gjurov, who chairs 
the executive board of the Ministry of Educa- 
tion and Science’s Bulgarian National Science 
Fund (BNSF). Gjurov advocates an unproven 
‘corkscrew’ theory of earthquake prediction. 

A growing number of scientists are now 
alleging that the funding agency is funnelling 
research cash towards bad science, and unfairly 
favouring those with close ties to the agency. 
This week, more than 400 researchers sent a 
petition to the prime minister and key science 
policy-makers, demanding a reassessment of 
the competition. “We don't have any idea why 
some projects didn’t win, because we had no 
feedback from reviews,’ says materials scientist 
Victor Atanasov, from the University of Sofia, 
who helped to organize the petition and whose 
application for graphene research was rejected. 
“And we have no idea why some projects won 
because no rankings were provided.” 

The protest comes at a time when support for 
science is at a low ebb. Since 2009, university 
budgets have fallen by more than 20% and the 
budget for the Bulgarian Academy of Sciences, 
which runs 41 research institutes in the country, 
has dropped by more than 40%. “It is obviously 
important that the small amount of money we 
have is spent appropriately,’ says Atanasov. 

The disputed competition was announced 
in May by the BNSE offering grants totalling 
14.8 million leva (US$9.8 million). Projects 
within six priority areas — including material 
sciences and biotechnology — were invited 
to bid for up to half a million leva each, and 
95 were eventually selected from roughly 
1,200 applications. 

The petition says that two of the projects, 
awarded 678,000 leva between them, were 
supported only because of their connection 
with Gjurov. Many geologists are sceptical of 
Gjurov’s earthquake-prediction theory, which 


” 4 
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Emil Horozov quit as head of Bulgaria’s National 
Science Fund over concerns about corruption. 


links geological processes, phases of the Moon 
and climate change. Some of the winning pro- 
jects involve his recently qualified PhD student 
and a fellow advocate of the corkscrew theory. 
Gjurov did not respond to Nature’s questions 
about the allegations against him. 


SATELLITE CONCERNS 

The petition also raises concerns about a 
project to build a nanosatellite, which snagged 
480,000 leva for two companies, called Bul- 
cube and Space Research. Both were registered 
as new companies only on 16 July, the deadline 
for the call, and each declared just 50 leva in 
capital. The project’s principal investigator, 
Valery Golev, head of astronomy at the Uni- 
versity of Sofia, says that the satellite project 
will provide useful training to help Bulgaria 
develop its space activities, and that he believes 
the new companies to be competent. 

Ina written statement to Nature, a spokes- 
person for the ministry said that the large 
number of applications in the competition 
demonstrated that scientists trusted the BNSF. 
The ministry did not respond to the allegations 
that grants were awarded inappropriately, or 
explain their choice of grant-winners. 

The protest comes as former BNSF director 
Emil Horozov, who alleged corrupt practices in 
research funding two years ago, finds himself 
the target of minor mismanagement charges. 
Horozov, a mathematician, became BNSF 


director in January 2010 with a mandate to 
reform the agency. He and his team produced 
a report detailing irregularities involving more 
than half of the projects funded in 2008 and 
2009, which Horozov passed on to ministry 
officials in November 2010. But Horozov 
resigned in February 2011, believing that the 
report was being ignored (see Nature 472, 19; 
2011). “We are talking here of tens of millions 
of euros, perhaps even hundreds of millions,’ 
claims Horozov. “I want to make sure the bad 
practices are exposed and stopped” 

A year later, the Public Financial Inspection 
Agency (ADFI) in Sofia — which is investi- 
gating the allegations — told Horozov that its 
inquiry would include his tenure as director. 
The ADFI subsequently identified 30 pro- 
cedural misdemeanours, and blamed nine 
minor infractions on Horozoyv. 

Horozov denies any impropriety, and alleges 
that the charges are a smokescreen to cover up 
the ADFT’s failure to address much more seri- 
ous problems raised in his report. The ADFI 
says that its investigation followed proper pro- 
cedures, and its findings are publicly available. 

Horozov’s report alleged, for example, that 
roughly 2.4 million leva were channelled with- 
out proper review to projects involving four 
foundations, which offered logistical services 
for research into Black Sea resources and 
marine infrastructure. One of the projects was 
actually rejected in 2009, but then approved in 
a closed meeting of the BNSF’s executive board 
in 2011, which also threw in a top-up donation 
of 800,000 leva to another project. There has 
been no formal review of the projects’ progress. 

All four foundations are registered in the 
name of businessman Galin Dimitrov. He 
told Nature that he is president of the Founda- 
tion for Development and Implementation of 
Public Resources, but is only a donor to the 
others, adding that the foundations offer legal 
and finance expertise to scientists. His goal, he 
says, is to make “Bulgaria a better place to live”. 

Scientists, however, believe that Bulgaria's 
meagre funds are being channelled away from 
worthy research. “We don't believe that the 
reviewing process was fair,’ says microbiologist 
Pavlina Dolashka from the Bulgarian Academy 
of Science's Institute of Organic Chemistry in 
Sofia, whose application on vaccine develop- 
ment was rejected. “As some of the grants were 
very large, we need to protest.” m 
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BENDING FOR DATA 


Radio signals from Global Positioning System (GPS) satellites bend as they pass through the atmosphere; the 
amount of bending relates to the atmospheric temperature and moisture levels. Each of these ‘occultation 
events’, picked up by a receiver, results in hundreds of data points at different elevations in the air column. 


GPS satellite in 
medium Earth orbit. 


ATMOSPHERIC SCIENCE 


Radio-occultation 
receiver in low 
Earth orbit. 


Microsatellites aim to 
fill weather-data gap 


Commercial network would use radio-sounding system. 


BY ERIC HAND 


Most point down towards Earth. But the 
satellites of the Constellation Observing 
System for Meteorology, Ionosphere and Cli- 
mate (COSMIC) look sideways, across the 
curving horizon. There, dozens of satellites 
that are part of the Global Positioning System 
(GPS) pop in and out of view at the edge of 
the planet. By tracking their radio signals, 
COSMIC can provide atmospheric data that 
enhance weather forecasts and climate models. 
But the fleet, launched six years ago at a 
cost of US$100 million, is nearing the end 
of its life, with one satellite of the original six 
already defunct. At a three-day workshop 


2 


MORE 
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ome orbiting satellites look up at the stars. 


TOP STORY 


last month at the University Corporation for 
Atmospheric Research (UCAR) in Boulder, 
Colorado, researchers hailed the US-Taiwanese 
COSMIC asa pioneer and discussed plans for a 
commercial successor: a network of 24 micro- 
satellites dubbed the Community Initiative for 
Cellular Earth Remote Observation (CICERO). 
Researchers say that the programme could help 
to address a gap in atmospheric data as the 
United States struggles to meet a 2016 launch 
date for the first spacecraft in its expensive 
Joint Polar Satellite System (JPSS). The radio- 
sounding technique that both COSMIC and 
CICERO use is a “disruptive technology’, says 
Rick Anthes, a COSMIC scientist and former 
president of UCAR. “The impact is huge — 
especially the impact for the cost.” 


| MORE NEWS | 
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GPS radio signals, picked up by Earth-bound 
receivers in everything from mobile phones to 
missiles, yield precise position information. 
But COSMIC puts them to a different use. The 
signals travel at a known rate, but skimming 
through the planet’s atmosphere and back out 
to space bends the signals and delays them; 
COSMIC uses the length of the delay to meas- 
ure the atmospheric density, which can provide 
information on changing characteristics such 
as temperature and moisture levels (see ‘Bend- 
ing for data). It makes many hundreds of these 
radio-occultation measurements each day. 

Radio occultation has been a tool for plan- 
etary science from the early days of the space 
programme. In 1965, NASA’s Mariner 4 craft 
broadcast radio signals to Earth across the edge 
of Mars, yielding estimates of the pressure of 
the red planet’s thin atmosphere. A decade 
later, the Voyager probes used the technique 
to sample the atmospheres of Jupiter, Saturn, 
Uranus and Neptune. 


COSMIC SUCCESS 

But for the technique to be useful on Earth 
— where basic atmospheric characteristics 
are already well known — a much denser net- 
work of transmitters and receivers is required. 
The GPS satellites offer a convenient set of 
transmitters, but for more than a decade the 
necessary receivers remained elusive. Anthes 
tried to gain US government support to launch 
COSMIC, but eventually gave up; he found a 
receptive audience in the then-nascent Tai- 
wanese space programme. “It was a struggle 
but we did it,” he says. 

Since 2006, the small mission has demon- 
strated its many advantages. Satellites that 
rely on infrared sounders cannot see through 
clouds; microwave sounders can be con- 
founded by intense moisture; but the radio 
waves seen by COSMIC pass unimpeded 
through even the worst storms. Furthermore, 
radio-occultation measurements depend 
only on the arrival of a signal, which can be 
timed using atomic clocks and easily com- 
pared between satellite systems and observing 
campaigns. That has made radio occultation 
an important calibration tool for climate sci- 
entists. 

Gottfried Kirchengast, who works on radio 
occultation at the University of Graz in Aus- 
tria, combined a decade’s worth of data from 
COSMIC and other sources and found that 


@ Revolution’s aftershocks still rattling — 4 
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com/ovzrsy 
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atmospheric temperature was rising — evidence 
of anthropogenic climate change (B. C. Lackner 
et al. J. Climate. 24, 5275-5291; 2011). “Radio 
occultation allowed the detection ofa trend in 
the shortest time period ever,’ he says. 

Now that COSMIC has lost one satellite, says 
Bill Kuo, director of the programme, “We're liv- 
ing on borrowed life, so to speak.” But radio- 
occultation data will continue to flow from the 
European Meteorological Operational satellite 
programme (MetOp), which launched a sec- 
ond satellite in September and is now provid- 
ing about 1,400 soundings per day (see ‘A global 
view ). For their part, COSMIC team members 
hope for a successor mission, COSMIC-2, 
which in 2016 would launch six satellites to 
orbit a narrow section of the tropics, gathering 
data that would reduce uncertainty in measure- 
ments of hurricane intensities by 25%, and in 
those of hurricane tracks by 25-50%. 

But CICERO has advantages over both of 
these programmes. Tom Yunck, the founder 
of GeoOptics in Pasadena, California, which 
is developing the network, says that CICERO’s 
fleet would observe many more occultations 
than COSMIC, because each member would 
track not only the 32 GPS satellites, but also the 
24 spacecraft in the Russian Global Navigation 
Satellite System and the 30 satellites that will 
comprise Europe's Galileo system by the end of 
the decade. In total, Yunck expects CICERO to 
generate an unprecedented 30,000 soundings 
per day or more. 

That could help to overcome a drawback of 
radio-occultation data. Conventional weather 
satellites can offer precise horizontal resolution 
relative to a particular spot on Earth’s surface, 
whereas radio soundings provide high-resolu- 
tion measurements vertically through the air 
column, with coarse horizontal resolution. To 
make up for that, CICERO will surround Earth 
with sensors that allow for observations along 
crossing sightlines, to sharpen the horizontal 
resolution of the measurements, and hence 
boost their value to forecasters. 

“If you look at the impact per observation, 
radio occultation scores quite highly,” says 
Sean Healy, a scientist in the satellite division 
of the European Centre for Medium-Range 
Weather Forecasts in Reading, UK. “There 
are clear arguments for trying to increase the 
number of radio-occultation data available.” 


PUBLIC-PRIVATE PARTNERSHIP 

Yunck also hopes to transform the econom- 
ics of weather data. Whereas most weather 
data are acquired with satellites paid for and 
launched by publicly funded agencies such as 
the US National Oceanic and Atmospheric 
Administration (NOAA), Yunck would build 
and launch CICERO with private funds, and 
then license the data to agencies in the United 
States and, potentially, elsewhere. Research- 
ers would have free access to the data through 
the government, just as they do now. Yunck 
says that GeoOptics has raised $4 million of 


A GLOBAL VIEW 


IN FOCUS | NEWS 


Over the course of just six hours, a handful of satellites can observe radio signals as they penetrate the 
atmosphere at multiple locations over land and sea, providing data that blanket the globe. 
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the $14 million that it needs to launch a two- 
satellite pilot project in mid-2014. That could 
expand to 24 satellites within 18 months; 
Yunck estimates that the total cost of build- 
ing and launching the constellation would be 
about $150 million. 


The COSMIC radio-sounding satellites are ageing 
but may set the stage for a commercial system. 


He hopes to prove the concept by launching 
CICERO before NOAA’ JPSS, a $12.9-billion 
programme of five weather satellites equipped 
with microwave and infrared sensors. The 
JPSS has exceeded its planned budget, and 
in July 2012, an independent review labelled 
its oversight “dysfunctional” and said that the 
JPSS’s delays and exorbitant costs — which the 
reviewers could not understand — were affect- 
ing NOAA’ credibility. 

Each JPSS satellite will weigh about 
2,500 kilograms. By contrast, a CICERO sat- 
ellite — little more than a high-fidelity receiver 
— would weigh just 85 kilograms and could 
piggyback as a secondary payload on another 
launch. Yunck acknowledges that the JPSS 


»COSMIC-6 »METOP-A *METOP-B »GRACE-A 


will do things CICERO never could, such as 
imaging and ozone mapping. But he says that 
CICERO would measure temperature more 
precisely, and certainly more cheaply. JPSS 
costs “have gone through the roof, and they’re 
getting nowhere’, says Yunck. 

In a statement, NOAA spokesman John 
Leslie said that, in spite of its challenges, the 
JPSS programme has “delivered impressive 
outcomes’, including the 2011 launch of a 
pilot craft, the Suomi National Polar-orbiting 
Partnership satellite. He adds that the agency 
“strongly supports” collecting data that will 
continue COSMIC’s efforts; NOAA is open to 
buying the data commercially, but is not con- 
vinced that commercial sources of data will 
become available in the short term. 

Yunck would like to change that impres- 
sion. If the CICERO model succeeds, it 
might usher in a slew of other commercial 
satellite networks that could provide a range 
of data, including greenhouse-gas observa- 
tions and gravity measurements, says Conrad 
Lautenbacher, chief executive of GeoOptics 
and a former head of NOAA. He adds, “I 
would like to put the government out of the 
business of doing routine measurements and 
observations that could easily be done by a 
commercial company.’ m 


CORRECTIONS 

In the News story ‘Hunt for life under 
Antarctic ice heats up’ (Nature 491, 
506-507; 2012), we wrongly stated that 
Martin Siegert was based at the University 
of Edinburgh. He is in fact based at the 
University of Bristol, UK. And the Editorial 
‘America’s carbon compromise’ (Nature 
491, 301; 2012) should have talked about a 
$20 tax per tonne of carbon dioxide not per 
tonne of carbon. 
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Jiangxi province, 


Inthis special issue, Nature examines the end of the 1997 
Kyoto climate treaty — and the path ahead. 


emitting greenhouse gases with abandon. 

The pollution-reduction commitments made 
by 37 nations as part of the Kyoto Protocol will 
expire, leaving the planet without any interna- 
tional climate regulation. 

In practice, the 1997 treaty did little to curb emis- 
sions of greenhouse gases (see page 656). Most of 
the parties to the treaty met their commitments 
easily but, because the Kyoto Protocol did not set 
limits for developing countries, the total emis- 
sions of greenhouse gases are rising faster than 
ever, thanks mainly to massive growth in coal con- 
sumption by China. A graphic view of the world’s 
energy resources shows just how difficult it will be 
to wean the planet off fossil fuels (see page 654). 
Many nations are acknowledging the inevitable and 
scrambling to gird themselves against stronger and 
more frequent floods, droughts, heat waves and 
other climate threats (see page 659). 

But Kyoto has provided valuable lessons. As 


0 n 1 January 2013, the world can go back to 


nations press forward towards a new climate 
treaty in 2015, they should focus on controlling 
the carbon that each country consumes, both at 
home and through imports, rather than the car- 
bon pollution that they emit, argues energy-policy 
researcher Dieter Helm (see page 663). They can 
also build on one legacy of the Kyoto Protocol: the 
carbon cap-and-trade systems and carbon taxes 
that have emerged in Europe, Australia, Japan, 
China, California and parts of Canada. Expand- 
ing these mechanisms to cover a bigger share of 
the world will be crucial for solving the carbon 
problem, says climate-policy researcher Michael 
Grubb (see page 666). Rather than a dead end, 
Kyoto could prove to be a first step towards an 
eventual solution. m 
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é At The legacy of a climate treaty 
1 1 nature.com/kyoto 
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More than ever, 
nations are powering 
themselves from 
abundant supplies 

of fossil fuels. 


F ven though countries are burning unprecedented 
amounts of oil and gas, the estimates of how much is 
left continue to grow, thanks to high prices and new tech- 
nologies that have enabled companies to find and extract 
new resources. A decade ago, it was the tar sands of Can- 
ada and Venezuela. More recently, hydraulic-fracturing 
technologies have opened up oil and gas resources in the 
United States. Across the globe, proven oil and gas reserves 
are 60% higher today than they were in 1991. At current 
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consumption rates, those reserves would last for about 
60 years — and that could be extended by new discover- 
ies and unconventional deposits. Coal reserves have not 
increased in size, but the supply will last for at least a cen- 
tury at current rates of consumption. 

Renewables such as solar and wind power are grow- 
ing faster than any other source of energy, but are barely 
making a dent in fossil-fuel consumption. The scale of 
the challenge will only grow as the expanding global 
population requires more energy. This tour of global 
and regional energy trends makes clear that even with 
aggressive action to reduce energy consumption and 
curb emissions, fossil fuels will be around for a very 
long time. = 
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In 2011, the globe 
consumed the equivalent 
of 12,275 million tonnes 
of oil. Figures for the top 
50 nations show how 
important fossil fuels 
remain; they supplied 
87% of the world’s 
energy. 
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ALL THE COAL IN CHINA US BONANZA 

Mainland China now accounts for half of global coal Hydraulic fracturing of tight rocks in the United States is driving a 
consumption but at current consumption rates, it only has surge in oil and gas production that would continue for at least a 

33 years of domestic coal left. decade in one scenario explored by the International Energy Agency. 
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fter 8 days of fractious negotiating, delegates at the 1997 
climate conference in Kyoto, Japan, were running out of 
time to deliver a treaty aimed at slowing global warming. 
The leader of the talks, Michael Zammit Cutajar of Malta, 
took the unusual step of invoking Zen Buddhism, telling 
everyone that they must break through mental barriers to 
achieve enlightenment. Two days later, after a marathon 
all-night session, the negotiators finally hammered out the 
climate agreement known as the Kyoto Protocol. It was the first — and 
so far, only — pact to commit rich countries to reducing emissions of 
carbon dioxide and other greenhouse gases. 

But even before the ink was dry on the agreement, it was clear that the 
protocol faced a rocky future. Although the United States had signed the 
treaty, President Bill Clinton signalled that the world’s largest economy 
would not ratify the pact unless China and other developing nations 
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Commitments made under the 
Kyoto climate treaty expire at the 
end of 2012, but emissions are 

rising faster than ever. 


agreed to limit their emissions, something that they had objected to 
doing before the developed world acted. By the time the Kyoto Protocol 
came into force in February 2005, the United States had pulled out. The 
remaining signatories — 37 developed nations and economies in tran- 
sition — pledged to reduce their greenhouse-gas emissions from 1990 
levels by an average of 4.2% in the period from 2008 to 2012. 

As that window closes, the countries that stuck with the treaty can 
claim some success. Overall, they met their target with room to spare, 
cutting their collective emissions by around 16%. But most of those cuts 
came with little or no effort, because of the collapse of greenhouse-gas 
producing industries in eastern Europe and, more recently, the global 
economic crisis. 

Furthermore, the cuts by industrialized nations have done little to 
combat the global problem. Worldwide emissions have surged by 50% 
since 1990, driven by economic growth in China and other parts of Asia, 
South America and Africa. In the 1990 base year, developed nations 
including the United States accounted for two-thirds of global emis- 
sions. Now, their contribution has dropped below 50%. 

“Kyoto had a very limited impact on climate,’ says Atte Korhola, an 
environmental-policy researcher at the University of Helsinki. “It was 
too narrow in ambition, its tools were too massively bureaucratic and 
it offered too many 
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The legacy of a climate treaty taught policy-makers 
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and possibly laid the groundwork for more ambitious efforts. “Kyoto was a 
grand policy experiment with important lessons we ought to take forward. 
Ithad its flaws — no wonder, you rarely get policies right the first time — 
but the overall architecture is still useful? says Roger Pielke Jr, who studies 
energy and innovation policy at the University of Colorado Boulder. 


DIFFICULT LEGACY 

The seeds of Kyoto problems were planted long before the treaty took 
shape. Many go back to June 1992, when negotiators at the Earth Sum- 
mit in Rio de Janeiro, Brazil, were hammering out the United Nations 
Framework Convention on Climate Change (UNFCCC), the umbrella 
treaty that would encompass the Kyoto Protocol. Negotiators in Rio were 
still crafting the document just hours before 
heads of state arrived to sign it. Pressed by time 
and mounting expectations, the delegates bor- 
rowed heavily from past treaties, including a 
US-Soviet nuclear-arms agreement and the 
1989 Montreal Protocol designed to protect 
the ozone layer, says Gwyn Prins, who studies 
environmental politics at the London School of 
Economics and acted as an adviser for the Brit- 
ish negotiating team in 1992. 

“Take out nuclear warheads, put in CO, — 
the basic idea was as easy as that,’ says Prins. “But it turned out that 
climate change is a much more wicked beast — scientifically and eco- 
nomically — than ozone chemistry or nuclear-arms control” 

A meeting in Berlin in 1995 created another major problem, when 
parties to the UNFCCC decided to divide the world into two categories 
for the future treaty. There would be a set of rich countries with ambi- 
tious climate responsibilities and a set of less-developed economies — 
including China — with no responsibilities. 

That decision, part of an agreement known as the Berlin Mandate, did 
not sit well with US politicians. In the summer of 1997, Robert Byrd, a 
Democratic senator from West Virginia and one of the senior politicians 
of his day, declared: “It is the Berlin mandate — and the fact that it lets 
the developing world off the hook scot-free — that will seriously harm 
the global environment in future years.” 

His colleagues agreed. The US Senate voted 95 to 0 in favour of a 
proposal demanding that developing nations participate in emissions 
commitments. Because Kyoto included no such commitments, the 
United States — the world’s largest greenhouse-gas emitter at the time 
— would not ratify it. 

The industrialized countries that remained with the treaty were each 
bound by individualized commitments, based on the state of their econ- 
omy and energy mix at the time (see ‘Uneven progress’). The devel- 
oped nations of Germany and Denmark agreed to cut their emissions 
by 21% relative to 1990 levels, whereas Portugal, with its less-developed 
economy, was allowed to increase its emissions by 27%. 

Kyoto covered four main greenhouse gases — CO,, methane, nitrous 
oxide and sulphur hexafluoride — and two further groups of gases, 
hydrofluorocarbons and perfluorocarbons. But it did not include 
another warming force: black soot particles from the incomplete com- 
bustion of wood and fossil fuels. 

Countries could meet their commitments by cutting their own 
emissions or by buying emission allotments from other nations that 
had exceeded their required reductions. Rich countries could also get 
credit by investing in low-carbon technologies in developing countries. 

For most central and eastern European nations, the job was easy: 
industrial emissions were high in the base year but had plummeted even 
by the time the treaty was signed. By 2010, Russia's CO, emissions were 
34% lower than in the base year (excluding cuts attributable to land-use 
changes) and Ukraine's had fallen by 59%. The United Kingdom also 
easily met its 12.5% reduction target, thanks to the closure of many coal 
mines and a corresponding drop in consumption. 

More recently, the economic downturn has helped to reduce 
emissions. Economists estimate that between 2007 and 2008, decreased 
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energy use caused a 2% drop in the emissions of the Kyoto Protocol 
countries; and that trend has continued as economies have sputtered. 

But the reductions made under the treaty were dwarfed by the rise in 
emissions not covered by the accord, especially in Asia. Since 2000, CO, 
emissions in China have nearly tripled to almost 10 billion tonnes, and 
those in India have doubled to around 2 billion tonnes. 

The rise in Asian emissions is partly a result of the migration of heavy 
industry from developed nations to developing countries, which make 
products that then get shipped back to wealthy nations. Between 1990 
and 2010, the emissions embodied in such products grew by an average 
of 10% per year — to an annual total of 1.4 billion tonnes — surpassing 
the total emissions reductions achieved under Kyoto, says Glen Peters, 


“KYOTO WAS A GRAND EXPERIMENT 
WITH IMPORTANT LESSONS WE OUGHT 
TO TAKE FORWARD.” 


a climate-policy researcher at the Center for International Climate and 
Environmental Research — Oslo. The gains made by the treaty were 
therefore deceptive, says David Victor, an energy-policy researcher at 
the University of California, San Diego. The treaty, he adds, was based 
on “dubious economic assumptions and flawed accounting systems”. 


FAULTY REASONING 

One of those dubious assumptions was that fossil fuels would soon 
grow scarcer and prices would spiral upwards, helping to push coun- 
tries towards alternative energy sources. But the globe is currently going 
through a massive coal renaissance, driven by abundant supplies that 
have grown much cheaper relative to other fuels in much of the world: 
the share of energy derived from coal has increased in the past ten years 
in both developing and developed countries. There has even been a shift 
towards coal in some parts of Europe, despite the mandatory cap-and- 
trade system to limit emissions. As a result, global energy production 
has grown more carbon-intensive in the past decade. 

“The fathers of the UNFCCC and Kyoto Protocol quite severely 
underestimated the amount of hydrocarbons buried in the ground,” 
says Ottmar Edenhofer, chief economist at the Potsdam Institute of 
Climate Impact Research in Germany and a lead scientist with the Inter- 
governmental Panel on Climate Change. 

These trends in energy use have made it nearly impossible for 
countries to limit global warming to less than 2 °C above preindustrial 
levels, the value chosen by the EU as a threshold likely to prevent dan- 
gerous climate change. Calculations suggest’ that emissions of CO, must 
stay below 1,000 billion tonnes between 2000 and 2050 to give the world 
a 75% chance of containing the temperature rise to 2°C. 

But emissions from fossil-fuel burning and deforestation since 2000 
have already pumped more than 450 billion tonnes of CO, into the 
atmosphere. If the current trend continues, the 1,000-billion-tonne 
margin will be surpassed in a little more than a decade. 

Despite its shortcomings, Kyoto has not been an utter failure, says 
Robert Stavins, an environmental economist at Harvard University in 
Cambridge, Massachusetts. Rather than judging the agreement on the 
emissions reductions it has achieved, he says, people should consider 
whether it has put the world on the right path. 

“Nobody with a right mind could have expected that a climate regime 
that treats China like sub-Saharan Africa and that excludes 50 develop- 
ing countries with a higher per-capita income than Romania could be 
anything other than a cautious first step,” he says. “What we need to 
create is a workable successor with binding national emission targets 
that all governments can be realistically expected to adopt.” 
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UNEVEN PROGRESS 


The nations with binding limits under the Kyoto Protocol reduced their overall greenhouse-gas 


Kyoto target: figure shown [J Emissions 
after country name 


emissions by 16% from 1990 levels, but many are likely to miss their individual targets. 
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Kyoto will leave a valuable legacy, says Yvo de Boer, former chief 
executive secretary of the UNFCCC and adviser for global auditing 
firm KPMG. The methodologies developed for reporting and verify- 
ing national greenhouse-gas emissions and land-use changes will be 
important components of any future climate treaty, he says. 

The protocol also gave birth to a method for trading carbon emissions 
among countries that face limits. Pioneered by the EU’s Emissions Trad- 
ing Scheme, which launched in 2005, this carbon market could one 
day become a globally linked CO, cap-and-trade system, says de Boer. 

Anadditional element of the Kyoto agreement — the Clean Develop- 
ment Mechanism (CDM) — established a way for rich countries to get 
credits towards their targets by making cost-effective emissions cuts in 
poor countries. Critics have charged that the CDM is plagued by cum- 
bersome bureaucracy and that some Western-funded clean-technology 
projects in developing countries would probably have been built without 
it. Nevertheless, a total of 5,000 CDM projects have attracted invest- 
ments worth almost US$100 billion. The projects have ranged from 
providing rural Chinese villagers with solar cookers to supporting a 
100-megawatt wind farm in Mexico. 

“Without Kyoto we wouldn't have achieved anything at all” in that 
area, says Victor. He would like to see a successor treaty constructed 
more like trade accords, which are tailored using realistic assumptions 
about commitments and rely on mutual action. “What one country is 
willing to pay to control emissions depends a lot on what its economic 
competitors will pay as well? he says. “More flexible treaties could help 
countries craft deals that are truly interdependent — where the efforts 
of one country get multiplied because they lead others to do more.” 


FOLLOW THE MONEY 

Many other policy experts agree that the next climate treaty must take 
a more pragmatic approach than the UNFCCC and the Kyoto Proto- 
col, which failed to win over the biggest polluters in part because it 
relied on a mix of ethical and environmental rationales rather than 
economic ones. “Making energy more expensive is a political liability 
everywhere,” says Pielke. “When emission reductions run up against 
economic growth, economic growth will inevitably win out. There is 
no magical solution, so you better set yourself tangible goals that aren't 
doomed to clash with the iron laws of politics.” 

Emissions targets for all countries should be allocated in a way that 
acknowledges the political and economic costs of complying with a 
climate agreement, argued Valentina Bosetti, a climate-impact modeller 
at the Eni Enrico Mattei Foundation in Milan, Italy, and Jeffrey Frankel, an 
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economist at Harvard, in a discussion paper last year”. China, for example, 
would be asked to accept only targets that it could meet without sacrificing 
its developmental aspirations; the United States would be assigned more 
stringent goals. But with time, all nations’ emissions targets would be 
adjusted progressively according to a common economic formula. 

Attaching a price to carbon, through cap-and-trade mechanisms or a 
direct carbon tax, would help by stimulating technological advances that 
reduce emissions. The challenge, says Pielke, is to get the price right and 
make sure that the revenue will go towards investments in technology. 

A moderate carbon tax — applied when fossil fuels are removed from 
the ground — might work best to stimulate innovation in technologies 
that will eventually make alternative energy sources cheaper than fossil 
fuels, he says. But the approach has to be global. 

Ina policy paper’ published in 2010, Pielke, Prins and 12 others called 
for a more pragmatic, diversified and less bureaucratic approach than 
Kyoto, which would wean the global economy off carbon as a by-product 
of reducing poverty and expanding energy access to the poor. 

The group takes the focus off CO,, which has a long lifetime in the 
atmosphere, and instead emphasizes cuts in black carbon and methane 
emissions, which dont last as long. This, say the paper's authors, would 
slow global warming more quickly and would provide time for a transi- 
tion to a low-carbon economy. They also suggest that negotiations for 
the next emissions treaty avoid topics such as deforestation, land use, air 
quality and adaptation, which would greatly complicate its architecture. 

That agreement will take shape slowly over the next few years. In 
Copenhagen in 2009, nations failed to produce a follow-on treaty to 
the Kyoto Protocol. However, in Durban, South Africa, last year, coun- 
tries including China and the United States agreed to negotiate a new 
climate treaty by 2015. If the past is any indication, the final details of 
that pact will not emerge until the sleep-deprived delegates have reached 
the deadline of the final negotiating session. 

Will the world find a solution to this so far intractable problem? “Pm 
confident it will” says de Boer, who presided over the unsuccessful negoti- 
ations in Copenhagen. “But I'm not convinced that it will come on time? = 


Quirin Schiermeier is a reporter for Nature in Munich, Germany. 
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Bangladeshis use the ubiquitous hyacinth weed to build floating, flood-proof crop gardens. 


NO GOING BACK 


With nations doing little to slow climate change, many people 
are ramping up plans to adapt to the inevitable. 


hen Superstorm Sandy hit the US 

coast last month, it blew millions of 

New Yorkers back into the nineteenth century. The southern 

part of Manhattan went black after floodwaters shorted out 

electrical systems. With the subway system disabled, many 

residents resorted to traversing the island by foot, and water 

supplies in some areas became contaminated with bacteria and pollutants. 
The largest Atlantic hurricane on record, Sandy wreaked US$50 bil- 
lion in economic losses along the US northeast coast, providing a costly 
reminder of how ill-prepared even the richest nations are for weather 
extremes. Some recent weather disasters have now been attributed, at least 
in part, to human activity, including the 2003 European heatwave’ and the 
floods in England in 2000 (ref. 2). According to the Intergovernmental 
Panel on Climate Change (IPCC), storms, floods and droughts will strike 
more frequently and with greater strength as the climate warms’. And if 


BY OLIVE HEFFERNAN 


business as usual and deal with the mess later. 
But greenhouse-gas emissions are increasing at 
an unprecedented rate and countries have failed to negotiate a successor 
to the Kyoto Protocol climate treaty. That stark reality has forced climate 
researchers and policy-makers to explore ways to weather some of the 
inevitable changes. 

“As progress to reduce emissions has slowed in most countries, there 
has been a turn towards adaptation,’ says Jon Barnett, a political geog- 
rapher at the University of Melbourne in Australia. 

Adaptation has tended to focus on hard defences, such as shoring up sea 
walls and building dams. But as awareness of adaptation has grown, so too 
has the concept. “Adaptation means different things to different people, 
and is extremely location specific,’ says Neil Adger, an environmental 
and economic geographer at the University of Exeter, UK. Although 
residents in Bangladesh can raise their houses on stilts to survive floods, 
some settlements in Alaska and the Maldives 


manage in a warmer, harsher future? 

Just a decade ago, ‘adaptation was some- 
thing ofa dirty word in the climate arena — an 
insinuation that nations could continue with 


nations are struggling to cope now, how will they 


AFTER KYOTO 


The legacy of a climate treaty 
nature.com/kyoto 


must move in the face of rising sea levels. 
Increasingly, it is local people who are 

deciding how to make their communities more 

resilient — and that is increasing the chances 
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of success. “A solely top-down approach to adaptation — focusing on 
heavy investment in engineering and infrastructure — will not work as 
it is expensive and impractical,” says Robert Lempert, who researches 
decision-making at the RAND corporation, a think tank in Santa 
Monica, California. 


FEARSOME FLOODS 

With its low-lying deltas, Bangladesh has always been threatened by 
storms that blow in from the Bay of Bengal. But the cyclone that hit the 
southeastern edge of the nation on 29 April 1991 caused massive flood- 
ing and killed some 138,000 people, mainly children and older women. 
Twenty-one years earlier, a cyclone had claimed up to a half million lives. 

Rising sea levels are increasing the risks to Bengalis, both from 
cyclones and from the spread of saline groundwater, which ruins aquifers 
and kills off crops. Projections by the IPCC indicate that a sea-level rise of 
1 metre — expected sometime within a century — would inundate up to 
20% of the country’s land areaand displace 14% of the total population’. 

In response, the country is busy building and strengthening its 
13,000-kilometre network of embankments, planting salt-resilient crops 
and storing fresh water. One project, supported by the United Nations 
Development Programme, taught 18,000 households in coastal com- 
munities to plant mangroves and fruit trees and to harvest rainwater by 
digging ditches. The project aims to provide fresh water and income 
as well as to protect against flooding and erosion. In 2005, the country 
became one of the first to complete a national adaptation programme of 
action, and it later established a climate-change trust fund for financing 
local, small-scale projects in adaptation and mitigation. 

Although Bangladesh’s per capita income ranks the nation among 
the world’s poorest, the country has mounted such a strong response 
to climate change that it has become something of a model for other 
nations, says Saleemul Hug, a Bangladeshi scientist and senior fellow 
in the Climate Change Group at the International Institute for Environ- 
ment and Development in London. “In the past few years, the level of 


"A SOLELY TOP-DOWN APPROACH TO 
ADAPTATION — FOCUSING ON HEAVY 
INVESTMENT IN ENGINEERING AND 
INFRASTRUCTURE — WILL NOT WORK.” 


public awareness about climate change in Bangladesh has skyrocketed 
and it is now probably higher there than in any other country,’ Huq says. 
But here and in other developing countries, it is hard to separate adapta- 
tion efforts from development that would have happened anyway. This 
confusion permeates discussions on financing adaptation efforts; of the 
$125 million that Bangladesh has received in climate funds from over- 
seas, it remains unclear how much is in addition to development aid. 

And like almost everywhere, Bangladesh is having to play catch-up 
with the climate. “There is an adaptation deficit out there right now to 
current climate variability,” says Kristie Ebi, an expert on climate and 
health impacts at Stanford University in California. 

Still, many experts say that Bangladesh has made significant progress. 
“What we can say is that there is collective action in the most climate- 
vulnerable country in the world,” says Huq. “The issue has galvanized 
people even across the political spectrum.” 

Whereas Bangladesh has focused on involving citizens in many small- 
scale projects, the city of Melbourne has sought to head off problems 
through a massive engineering venture. Over the past decade, southern 
Australia has been hit by the worst drought in a century. After water 
restrictions implemented in 2007 angered thousands of farmers, the state 
of Victoria announced it would invest Aus$3.5 billion (US$2.9 billion) 
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in a new desalination plant at the site of Wonthaggi and commissioned 
a pipeline to bring water to the region from a river in the north. Overdue 
and over budget, the desalination project will take decades to pay off and 
is eating up the region’s adaptation resources, argue critics. 

It is a clear case of maladaptation and will increase overall vulnerability 
to climate change, says Barnett, who has studied the project”. By investing 
so heavily in the desalination plant, the Victorian government has effec- 
tively shut off the possibility of funding other adaptation options, such 
as harvesting rainfall and recycling domestic waste water from showers 
and dishwashing, which would be cheaper and more effective, he says. 

But John Thwaites, the water minister of Victoria until mid-2007, 
says that Melbourne has taken a multi-pronged approach that includes 
conservation and harvesting storm water. The government originally 
preferred measures other than desalination, but decided to build the 
plant in response to the unprecedented water shortage. 

The United Kingdom has taken a different approach to planning 
for water problems. In autumn 2000, rainfall was at its highest since 
records began in 1766, causing devastating floods that left the village of 
Hambledon under water for six weeks at a cost of more than £1 billion 
(US$1.6 billion). Early this year, a long drought led to water restrictions 
in parts of Britain, but the spring and summer that followed were the 
wettest in a century. Climate projections indicate that the country will 
face more frequent droughts and floods, but the models do not agree 
on the extent and timing of the changes. 

Given the uncertain predictions, UK water companies are adopting a 
more flexible approach for making decisions about building reservoirs, 
extracting groundwater and other water-related plans. Rather than 
developing one strategy and sticking with it, they plan incrementally 
and frequently re-evaluate on the basis of new information, says Nigel 
Arnell, director of the climate research-focused Walker Institute at the 
University of Reading, UK. 

Communities living on tiny islands, however, don't have the luxury of 
considering many different options and reevaluating plans. For people 
in the Maldives, Kiribati and Carteret, there is 
simply nowhere to retreat when rising sea lev- 
els infiltrate their drinking-water supplies and 

ood their homes, so they will have to flee. 

Small island nations can learn valuable les- 
sons from the Alaskan village of Newtok, which 
is already in the process of moving. Located 
on the Ninglick River next to the Bering Sea, 
Newtok is below sea level and losing ground at 
arate of roughly 22 metres per year to erosion. 
The villagers selected a site for their new home 
on Nelson Island, which is 15 kilometres away. 
Turning a hardship into an opportunity, the residents are learning build- 
ing skills to construct sustainable homes in their new village, and that 
will give them more job options in the future, says Robin Bronen, head 
of the Alaska Immigration Justice Project in Anchorage, which works 
with communities being relocated as a result of climate change in Alaska. 

The experience in Newtok serves as a model for how ‘climigration’ 
should work in practice, says Bronen. “That’s because of the process — 
the community has made all of the decisions.” 

Heat is often considered less dangerous than floods, but some of the 
most serious consequences of climate change will arise from hot weather. 
During the 2003 heatwave in Europe, temperatures reached their highest 
since 1500, topping 40°C for seven days in some parts of France. Almost 
15,000 people died in that country alone. One climate modelling study of 
the Mediterranean region found that by the end of the current century, 
the frequency of heatwaves will increase from one every 3-5 summers 
to 2-3 heatwaves each summer; and they will last 2-5 times longer’. 

In the aftermath of the 2003 disaster, France established a heatwave 
warning system, one of 12 now in operation throughout Europe’. In 
France, alerts are triggered when the five-day weather forecast predicts 
that temperatures will exceed thresholds for three days. In addition, sci- 
entists analyse mortality data, hospital patient loads and drinking-water 
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The costly Wonthaggi Desalination Plant in Australia has been criticized for consuming adaptation resources. 


supplies. The system can issue public warnings, mobilize personnel to 
visit vulnerable populations and call hospital and nursing-home staff 
back from their holidays. In 2006, a heatwave similar to the 2003 event 
put the system to the test; although it did reduce mortality and morbid- 
ity, thousands still died, in part because a significant fraction did not 
receive the warning and many did not heed the advice. 

Ebi says that cities can do much better. “Heat-related deaths are com- 
pletely preventable,’ she says, if people are warned and told how to protect 
themselves. Yet the most vulnerable — the elderly, ill and poor, for exam- 
ple — often don't see themselves as being at risk, says Graham Bickler, 
regional director at the Health Protection Agency in Brighton, UK. So 
one challenge is identifying and communicating to those groups. Dur- 
ing a heatwave in Chicago, Illinois, in 1995, Ebi says, many of those who 
died were adults over 65 who lived in poor areas, where it was common 
practice to board up windows to protect against break-ins. The fans they 
used to try to keep cool had the reverse effect and turned their homes 
into convection ovens. They could have reduced their risk by consuming 
cold liquids and foods, taking cool showers and going to air-conditioned 
public spaces, says Ebi. 


EARLY WARNING 

Like Bangladesh, the nations of sub-Saharan Africa are particularly vul- 
nerable to climate change, but they have received relatively little adapta- 
tion funding from international donors. Mozambique stands out as one 
of the most threatened, with 2,700 kilometres of coastline and more than 
half its 24 million inhabitants living in poverty. 

Between 1965 and 1998 the country experienced 12 major floods, 
9 major droughts, and 4 major cyclone events. Located at the end of river 
systems that stretch more than 1,000 kilometres into other countries, 
Mozambique can be struck floods without warning. “Mozambique is 
a country where every year there is a weather-related problem,” says 
Filipe Lucio, a Mozambican meteorologist who now heads the Global 
Framework for Climate Services Office at the World Meteorological 
Organization in Geneva, Switzerland. 

In 2000, Mozambique was hit by a flood worse than any in its history; 
it left 700 dead and caused damages totalling nearly US$300 million, 
a significant fraction of its budget at the time. The event “wasn't at all 
anticipated”, says Lucio. Warnings of above-average rainfall came too 
late and failed to convey the magnitude of the coming flood. Since then, 
the country has switched to a colour-coded system that indicates the lead 
time for a predicted event and has recruited locals to record and monitor 


precipitation and water levels — information they can use to raise alarms. 

The system has succeeded in reducing risks. In 2007, a flood of similar 
magnitude to the 2000 event killed just 29 people. Looking to the future, 
the government has commissioned a study on national climate impacts 
and is working on a long-term strategic adaptation plan. But efforts to 
implement these plans will probably be hindered by lack of funding and 
sluggish bureaucracy’. The state has a budget of just $5 billion, and half 
of Mozambique’s funds come from overseas aid, so resources are scarce 
even for crucial areas such as education and health. As of November 
2011, Mozambique had received $30 million in overseas climate finance 
and a further $86 million has been pledged by the Pilot Program on Cli- 
mate Resilience for a range of issues from upgrading roads to improving 
its meteorological service®. But experts say that this is far from enough. 

It is a picture repeated across the globe. In 2011, developing nations 
received only about $960 million in money dedicated for adaptation- 
related activities, but a 2007 report by the United Nations Development 
Programme estimated that developing countries would need $86 billion 
a year by 2015 in funding to adapt to climate change’. 

The issue is pushing the world’s rich and poor farther apart, says 
Desmund Tutu, the former archbishop of Cape Town in South Africa. 
In the UN report, he warned that “we are drifting into a world of ‘adap- 
tation apartheid”. 

For both wealthy and poor nations, the challenge is to convince peo- 
ple to act before it is too late. “Adaptation is largely a matter of changing 
social processes so that fewer people are at risk,” says Barnett. “This of 
course won't be easy — but it does mean the solutions are determined 
by people, not by nature” = 


Olive Heffernan is a freelance writer and the former editor of Nature 
Climate Change. 
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Around 80% of China’s electricity generation is coal-fired. 


The Kyoto approach 


has failed 


Abandon coal, price carbon consumption and look to new technologies 
for a lasting solution to global emissions, argues Dieter Helm. 


he Kyoto Protocol, agreed in 1997, 

| is the centrepiece of global efforts to 
address climate change by reducing 
greenhouse-gas emissions. Its first commit- 
ment period expires this year, but despite 
the political capital invested in it, numer- 
ous subsequent Conference of the Parties 
(COP) meetings and considerable economic 
costs, it has had no noticeable impact on 
global carbon emissions. These remain on 
an upward curve, increasing from almost 


2 parts per million (p.p.m.) a year in the early 
1990s to almost 3 p.p.m. now, and heading 
towards the critical threshold of 400 p.p.m.. 

It will only get worse. At the Durban COP 
in December 2011, all that could be agreed 
was that the participant countries would try 
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to agree by 2015 what they might do after 
2020. At current growth rates, by 2020 the 
economies of China and India will be twice 
their present size, requiring the addition of 
400-600 gigawatts of coal-fired generating 
capacity to their electricity systems'. 

The reasons for the Kyoto Protocol’s 
ineffectiveness are in its architecture. It 
is based on carbon production, not car- 
bon consumption. It has a mainly Euro- 
pean focus. It does nothing to address 
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> the immediate problem of global coal 
burning. It is wide open to free-riding, 
allowing nations to avoid cutting emissions 
while others do so, and it has few enforce- 
ment mechanisms. These are deep flaws that 
render the protocol incapable of slowing 
emissions, let alone reversing them. Fortu- 
nately, other, better, bottom-up approaches 
hold hope for progress. 


CARBON FOOTPRINT 

The idea at the heart of the Kyoto Protocol 
is that the developed countries accept caps 
on carbon production from power stations, 
industrial installations and the like within 
their borders. Developing countries take 
measures but need not apply caps. On aggre- 
gate, carbon emissions should have been 
reduced by about 5% below 1990 levels by 
the end of 2012. 

The main problem with the Kyoto 
approach is that it does not address the 
carbon footprint — carbon consumption. 
A country’s (and an individual’s) carbon 
footprint is best measured by looking at the 
carbon embedded in the goods and services 
that each consumes. Global warming takes 
no account of national boundaries. If a US 
consumer buys a car, it matters little whether 
the steel within it is made in the United 
States or China. 

The difference between carbon produc- 
tion and carbon consumption is not trivial. 
Take the United Kingdom: from 1990 to 
2005, its carbon production fell by around 
15%. But carbon consumption went up by 
around 19% once the carbon embedded in 
imports is taken into account’. 

From a Kyoto perspective, this is a 
triumph; for climate change, it is a disaster. 
It explains how emissions can apparently fall 
in Europe but go up globally as rapidly devel- 
oping countries, such as China and India, 
export energy-intensive goods to Europe 
and the United States, which together make 
up around 50% of the world’s gross domestic 
product. 

It is not surprising that Europe has led the 
way on Kyoto. Carbon-production targets 
have been comparatively easy to meet, and 
they make Europe look good. But the real 
reasons for the fall in carbon production do 
not give much cause for celebration. 

The collapse of the Soviet Union began 
at the end of the 1980s — nicely timed for 
the use of 1990 as Kyoto’ reference baseline 
year (see ‘Carbon climb). Before this, East- 
ern Europe was notorious for inefficient, 
energy-intensive industrial production, 
much of which — from a Kyoto perspec- 
tive — conveniently stopped after the fall of 
the Berlin Wall. Once the United States had 
opted out of the Kyoto Protocol, the agree- 
ment needed Russia to come on board so 
that it could come into force; it was a condi- 
tion that the protocol should be ratified by 
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at least 55 countries, covering 55% of global 
emissions in 1990. Russia brought lots more 
‘hot air’ — emissions reductions that were 
inevitable. 

Better still from the perspective of Kyoto 
compliance, western Europe was de-indus- 
trializing too, switching away from energy- 
intensive production activities towards 
service industries, in part because Chinese 
exports were outcompeting industries in 
Europe. 

Most of this would have happened any- 
way. Europe’s green policies have made little 
difference, and the economic crisis has made 
reducing its carbon production even easier. 
The 2008 climate-change package from the 
European Union (EU) focuses on the short 
term’*. By 2020, it aims to reduce EU carbon 
emissions by 20%, to increase energy genera- 
tion from renewables by 20% and to boost 
energy efficiency by 20%. Aside from the 
economic illiteracy 


of assuming that “Thereal villain 
everythingaddsup of growing 

tothe magicnum- global emissions 
ber of 20, the effect hasheen 


has been to focus 
almost all resources 
on a small number of current renewable- 
energy technologies — wind, rooftop solar 
and biomass. 

These measures were intended to 
reinforce the EU Emissions Trading Scheme 
(EU ETS)‘, which has produced a short- 
term, volatile and low price for carbon when 
what is required is a medium- to long-term, 
stable but rising carbon price. The net effect 
of all these EU measures (especially the 
renewables) has been to drive up energy 
prices and reduce European competitive- 
ness, while making almost no contribution 
towards mitigating global emissions. 


ignored: coal.” 


THE COAL PROBLEM 

The real villain of growing global emissions 
has been ignored: coal. Since the mid- 1990s, 
coal has risen from supplying around 25% 
of the world’s primary energy to almost 
30% now, in a context of a rapidly growing 
underlying energy demand’. 

Much of this coal burning has been 
in China, which switched from being an 
exporter to an importer of coal in the 1990s, 
and now accounts for a staggering 50% of 
world coal trade. Its share of global coal pro- 
duction is almost four times that of Saudi 
Arabia’s production of oil®. Around 80% of 
China’s electricity generation is coal-fired. 
China and India together add around three 
coal-fired power stations a week to their gen- 
eration portfolios. 

China plans to improve its energy effi- 
ciency, reducing its energy intensity under 
its 12th Five Year Plan by 16% between 
now and 2015. It intends to develop gas 
and renewables. But the arithmetic of an 
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economy that doubles in size every decade 
puts these changes into perspective: a slightly 
smaller proportion of coal-fired electricity 
generation in an economy twice the size by 
2020 becomes lost in the noise. The world 
faces a further huge increase in coal burning 
between now and 2020, and the result will 
be ever-rising emissions. The Kyoto Protocol 
has almost nothing to say about this. 

The story on coal elsewhere is mixed. In 
the United States — which remains outside 
the Kyoto Protocol — carbon emissions 
have been falling sharply’, and faster than 
in crisis-ridden Europe over the past five 
years. The United States is shifting from coal 
to natural gas for electricity generation and 
industry as the full impact of the shale-gas 
revolution is played out. Natural gas pro- 
duces fewer pollutants and half the carbon 
emissions of coal. 

With the price of natural gas in the United 
States currently around one-quarter of that 
in Europe, and even lower than that in 
China, economics is driving a shift towards 
natural gas without the need for any drastic 
energy or climate-change policies. So great 
is the competitive advantage bestowed by 
shale gas that energy-intensive industries are 
beginning to migrate from China back to the 
United States. Ironically, these repatriated 
industries will push up US carbon produc- 
tion while reducing China’. The net effect is 
good from a global climate-change perspec- 
tive — less transport to the United States and 
more-efficient electricity generation — but it 
would make the country look bad by Kyotos 
emissions-cap approach. 

In Europe, the irony is deeper still — many 
countries are switching from nuclear and gas 
to coal. Germany stands out. It has prema- 
turely closed some of its existing nuclear- 
power stations, and is fast-tracking the 
closure of the rest within the next decade. It 
is getting out of low-carbon generation in a 
big way. Germany is increasing the burn in 
its existing coal power stations, and building 
new ones that burn lignite, one of the dirti- 
est forms of coal. The use of natural gas is 
being squeezed out by low coal prices across 
Europe, and because the EU ETS carbon 
price is so lowas to be negligible. 


FREE-RIDING 
Advocates of the Kyoto approach argue that 
these problems are temporary. Over time, 
other countries will join, eventually result- 
ing in a complete set of carbon-production 
caps. Then the distinction between carbon 
production and carbon consumption will 
not matter. Putting aside the facts — that 
nothing much is going to happen until at 
least 2020, and that we don't have the luxury 
of waiting as emissions pile up — why should 
we have any confidence in the gradual evolu- 
tion of Kyoto? 

The core architecture of the protocol relies 
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CARBON CLIMB 


Global atmospheric carbon dioxide concentrations have risen steadily since the Kyoto Protocol was signed. 


Reference 
baseline year 


Kyoto Protocol 
agreed 


Atmospheric carbon dioxide (p.p.m.) 


325 
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on countries adopting emissions caps and 
complying with them in the face of major 
free-rider incentives, differential impacts 
and no serious enforcement mechanisms. 
Kyoto is similar to the classic prisoners’ 
dilemma in game theory. Each country may 
be better off if all nations cooperate, but each 
may be tempted to free-ride on the costs and 
efforts of others. 

For instance, emissions from some coun- 
tries are capped, but those from others are 
not. The result is that the implicit (or explicit) 
price of carbon in countries with a cap cre- 
ates a trade distortion in favour of those 
uncapped countries that do not have an 
effective carbon price. Put bluntly, not pric- 
ing carbon properly on a comparative basis 
in China is equivalent to an export subsidy’. 

The problems for a top-down approach 
such as Kyoto are made worse by the fact 
that the impacts of climate change are not 
all bad, and vary considerably. Arctic coun- 
tries, such as Russia and Canada, have much 
to gain from the resources that will become 
accessible once the ice has melted. For tem- 
perate zones, initial rises in temperature 
might mean that less energy will be needed 
for winter heating and, in some areas, could 
result in higher agricultural productivity. 

Warming of the planet by more than 2°C 
will change this cost-benefit equation, but if 
costs and benefits differ, getting a top-down 
agreement is made all the harder. For Rus- 
sia and Canada, for example, the situation 
is very different from that in poorer tropical 
nations. No wonder both these Arctic coun- 
tries were effectively in the ‘reluctant camp’ 
at the Durban COP, joining the United States 
on the outside. 

Fortunately, there is a better way forward. 
Instead of taking a top-down approach that 
requires global agreement, climate-change 
policies can be constructed from the bottom 
up using three key building blocks: putting a 
tax on carbon consumption; switching from 
coal to gas as quickly as possible; and boost- 
ing spending on new energy technologies. 


2000 


European Union 
climate-change 
package 


World 
economic crisis 


Durban 
meeting 


Europe starts 
emissions trading 


2005 2010 


The first building block is to recognize that 
carbon consumption is more important 
than carbon production, and so the carbon 
price should be based on consumption. This 
means pricing the consumed carbon irre- 
spective of where it was produced. 


TAXING CARBON TRADE 
If the exporting and importing countries 
both have domestic carbon prices, then 


it does not matter what the base is. But if 


an exporter of carbon-intensive goods — 
such as China — does not price carbon at 
an appropriate level, there needs to be a 
border tax adjustment on imports. 
Although there are lots of practical issues, 
only a small number of carbon-intensive 
industries make up the bulk of carbon 
trade, so in practice, a targeted set of border 
taxes should do the job. 

The neat consequence of this approach 
is that countries can take their own meas- 
ures to address carbon without reducing 
their competitiveness, and it encourages the 
exporter to introduce its own carbon price 
to avoid the money going to the importer’s 
government. 

Proper carbon pricing also helps to 
encourage the second building block — the 
coal-to-gas switch. In the short term, this 
switch is perhaps the only way of slowing 
down the wall of coal that will come into 
world energy systems in the next decade. 

Pricing carbon and getting out of coal pro- 
vide the only serious prospects for having 
some effect on emissions in the short term. 
Further ahead, what matters is technology. 
None of the existing technologies is well 
placed to crack the climate-change problem 


— especially not the current generation of 


wind turbines and solar panels. Biomass and 
energy crops have similar constraints: there 
is not enough land, water or shallow sea to 


yield sufficient energy to meet the needs of 


a global population that is predicted to rise 
to nine billion by 2050 (ref. 9). 
The good news is that there are many new 


technologies that might help to crack the 
problem — from next-generation solar to 
geothermal and even new nuclear technolo- 
gies. This is the third and most important 
building block. 

Although politicians have put all their 
efforts into Kyoto, and Europe has over- 
whelmingly invested in current renewables 
partly asa result, the prize has been ignored. 
It would be better if the focus shifted to what 
really matters — pricing carbon consump- 
tion, getting out of coal as fast as possible 
and investing heavily in future renewables 
and new energy technologies. 

Little, if any, of this will be achieved 
through the Kyoto approach. At each COP, 
the now well-established process of green 
groups and environmental ministers gath- 
ering together to make declarations of intent, 
without much concrete action, is played out. 
Copenhagen in 2009 was supposed to dem- 
onstrate Europe's world leadership in craft- 
ing a new climate agreement. In fact, the 
United States and China agreed the weak 
Copenhagen Accord outside the conference 
framework, and without Europe. At Durban, 
even the prospect of serious action was put 
off until after 2020. 

There is merit in world leaders meeting, 
not least because it puts climate change in 
the media spotlight. Yet the Kyoto confer- 
ences allow participants to be seen to be 
taking climate change seriously while actu- 
ally doing very little. Climate-change policy 
cannot wait another decade. The net results 
of Copenhagen and Durban have been to 
make Kyoto largely irrelevant. It is prob- 
ably worth going on with the talking, but 
the main chance lies in getting on with the 
bottom-up approach. = 
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CARBON TRADERS 


Cities, regions and emerging economies are following Europe in setting up carbon-emissions trading schemes. The 
amount of carbon capped (filled circles) is shown to the same scale as national emissions volumes (blue circles). 
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Cap and trade 
finds new energy 


Anemerging coalition is implementing carbon trading 
despite political obstacles. It is rewriting the map of 
climate diplomacy, says Michael Grubb. 


presented a blueprint for a way to curb 

the increase of carbon dioxide in the 
atmosphere. One key measure, put forward 
by the administration of Bill Clinton, the US 
president at the time, was an enticing exten- 
sion of free-market logic: establish emis- 
sions quotas to limit pollution, and trade 
them internationally. Let the market find the 
lowest-cost way to deliver the goal. 

Cap and trade is one of two ways to put a 
price on pollution — taxation is the other. 
And emissions trading had been enacted 
successfully through the US Clean Air Act 
of 1990 to limit sulphur dioxide, a cause of 
acid rain. 

The United States rammed a global 
cap-and-trade model into the Kyoto 
Protocol, allowing industrialized countries 
to meet their commitments with maximum 
flexibility, such as by buying emissions cred- 
its from projects in developing countries. 
The model was crucial in agreeing national 
emissions caps for Kyoto’ first ‘commitment 
period, from 2008-12. 

That time has proved long enough for 
the grand plan of a globally negotiated, 
unified cap-and-trade system to unravel. 
The United States never ratified the pro- 
tocol. Europe's flagship Emissions Trading 
Scheme (ETS) is in trouble. Yet, something 


Bee« years ago, the Kyoto Protocol 
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unexpected is happening. Smaller carbon- 
pricing schemes are springing up across the 
world (see ‘Carbon traders’). This renewed 
momentum could yet deliver an effective 
response. 


ROCKY ROAD 

Even though the idea of cap and trade 
originated in the United States, it fell to the 
European Union (EU) to implement it for 
CO, after US president George W. Bush 
repudiated the Kyoto Protocol in 2001. The 
EU’s rapid establishment of a carbon cap and 
price covering CO, emissions from power 
generation and industry across 27 countries 
(plus some neighbours) was an astonishing 
achievement. 

The EU ETS evolved in phases. Its first 
three years (2005-07) ended with the carbon 
price crashing to zero as industry — fearing a 
shortage of emissions allowances — instead 
achieved a surplus by over-complying with 
emissions cutbacks. 

The second phase coincided with Kyoto’s 
first commitment period. The European 
Commission won a historic legal and politi- 
cal battle to force EU member states to set 
emissions caps aligned with their national 
Kyoto commitments. To avoid another 
pricing roller coaster, the rules allowed for 
surpluses to be ‘banked’ for later use. In its 
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first four years, the ETS is estimated to have 
cut CO, emissions by 40 million to 80 mil- 
lion tonnes a year on average, or 2-4% of the 
total capped’. 

Flush with apparent success, the EU 
advocated a global carbon market of similar 
schemes operating in the member coun- 
tries of the Organisation for Economic 
Co-operation and Development (OECD) 
by 2015 and expanding to include all 
major emitters by 2020. With the US presi- 
dency of Barack Obama, who espoused 
cap and trade in his 2008 campaign, it 
seemed that such a vision was back on 
track. The US Congress began work on the 
Waxman-Markey energy bill, which had 
emissions trading as its centrepiece. 

But events intervened and appetites for 
carbon trading waned. The credit crunch 
brought complex trading instruments into 
disrepute. The EU ETS saw scandals such as 
the theft of allowances from registries and 
fraud associated with complex tax treat- 
ments across borders. 

In July 2010, the Waxman-Markey 
bill crashed in the Senate. The consensus 
became that the United States would not 
stomach carbon pricing — the public would 
never accept taxation, and the Senate had 
rejected the only alternative. Meanwhile, 
it became clear that the EU would over- 
achieve its second-phase Kyoto targets, 
owing to recession and progress on energy 
efficiency and renewable energy. 

To give industry more time to plan, the 
third phase of the EU ETS was extended 
to 2020. This turned out to be a poisoned 
chalice. The surplus of emissions permits 
from phase two is now so big that it covers 
all of the cutbacks agreed up to 2020 — ren- 
dering the phase-three cap almost irrelevant. 

The ETS carbon price 


“Without has correspondingly 
changes, the slumped to around 
future of the €7 (US$9) per tonne, 
EUETSin less than one-third of 
driving that projected before 
emissions Europe’ recession. 

reductions This low price can- 


not incentivize invest- 
ment in low-carbon 
technology, and it has devastated revenues 
that were expected to fund innovations such 
as carbon capture and storage. Instead, old 
plans for new and upgraded coal-fired power 
plants have been dusted off, predicated on 
the belief that the ETS surplus will sustain 
low carbon prices through 2020. Without 
changes, the future of the EU ETS — and its 
role in driving emissions reductions — looks 
bleak. 

Many pundits rushed to declare emissions 
trading dead. Academics bashed targets and 
caps as a ‘top-down political process out of 
sync with the real world. The intelligentsia 
offered an alternative ‘bottom-up vision of 


looks bleak. ” 


SOURCE: A. PRAG, G. BRINER & C. HOOD MAKING MARKETS: UNPACKING DESIGN 
AND GOVERNANCE OF CARBON MARKET MECHANISMS (OECD/IEA, 2012). 


localized mitigation efforts that would not 
rely on emissions caps, pricing or interna- 
tional negotiations’. 

These contributions underlined the 
importance of local politics and gave 
renewed attention to energy efficiency and 
innovation. But the alternative vision was 
also out of touch with reality, because it could 
not answer three fundamental questions. 

First, major investments are made on the 
basis of economic returns. If there is no car- 
bon price, why should investors decarbon- 
ize? The alternative is regulation directed 
against the grain of price signals — hardly 
appealing to conservative critics. Second, 
emissions targets set goals that can direct 
effort. Why would not specifying a target be 
more likely to spur action? Third, innovation 
requires years of nurturing, development, 
commercialization and growth, based on the 
prospect of profitable markets. Why should 
low-carbon technologies emerge faster with- 
out carbon goals and prices to reward and 
help to finance such innovation? 

Some advocates of the new vision cite the 
decline in US emissions, despite the lack 
of a carbon price, as proof of concept’. But 
it does not follow that carbon trading and 
targets are irrelevant. High global oil and 
coal prices, economic slowdown, stronger 
energy- efficiency programmes and renewa- 
ble energy policies have contributed to lower 
US emissions, as has an abundance of cheap 
shale gas. Both price and non-price factors 
are important. Cheap gas will reduce emis- 
sions if it displaces coal but not ifit displaces 
renewable energy sources. The CO, savings 
achieved in recent years could be reversed 
if there is not a carbon price sufficient to 
ensure that gas remains cheaper than coal 
for generating power. 


THE NEW RULES 

In practice, events are challenging the ‘top- 
down and ‘bottom-up’ caricatures as well as 
the assumed leadership role of developed- 
country governments. With stalled national 
progress and a mix of concerns about cli- 
mate change, energy security and stimulat- 
ing investment and innovation, some states, 
cities and emerging economies are forging 
ahead with local carbon taxation and trad- 
ing policies’, 

Asian economies have rushed to fill the 
void. In 2009, South Korea adopted the most 
environmentally oriented stimulus package 
ofany country, and legislation for emissions 
trading passed its parliament in May 2012. 
Beginning in 2015, the scheme will set caps 
for facilities responsible for 60% of the coun- 
try’s emissions’. 


AFTER KYOTO 


The legacy of a climate treaty 
nature.com/kyoto 


China’s National Development and 
Reform Commission in July 2010 launched 
pilot ‘low-carbon development zones’ in 
five provinces and eight cities. Under the 
country’s five-year plan for 2011-15, five 
cities (Beijing, Tianjin, Shanghai, Chong- 
qing and Shenzhen) and two provinces 
(Guangdong and Hubei) will establish pilot 


emissions trading schemes. Adoption of 


a national cap-and-trade scheme by 2015 


seems implausibly fast, but with the pace of 


Chinese developments, who knows. 
India has developed 


the ‘Perform Achieve “Smaller 

Trade’ system toimple- carbon- 
mentenergy-efficiency pricing 

goals in three phases schemes could 
during 2012-20, for yet deliver 
essentially the same an effective 


sectors covered by the 
EU ETS. 

In wealthy countries, several regional gov- 
ernments have pushed ahead. This month 
California held the first auctions for its 
CO, emissions trading system, with which 
Quebec is also affiliated. In 2008, British 
Columbia introduced the first major carbon 
tax since the Scandinavian countries did 
so 20 years earlier, with the price rising to 
Can$30 (US$30) per tonne in 2012. In 2011, 
Tokyo became the first city to adopt a munic- 
ipal cap-and-trade system for carbon — an 


response.” 


example emulated by Rio de Janeiro ahead of 


Rio+20, the 2012 United Nations Conference 
on Sustainable Development in Brazil’. 

Australia has clung to its domestic 
scheme, which came into force this summer 
as a three-year carbon tax that in 2015 will 
morph into a trading scheme linked with 
the EU ETS. The Australian government 
announced on 9 November that it intends to 
join the EU in signing on for a second period 
of the Kyoto Protocol. 

By next year, about 10% of global emis- 
sions will be covered by a carbon price; by 
2015, the Korean and Chinese pilot schemes 
will take this closer to 15% — and more plans 
are emerging. The effort will look nothing 
like the Kyoto vision or the EU’s idea of an 
OECD-wide market. But it will be the first 
big step for an emergent ‘coalition of the will- 
ing’ that recognizes carbon pricing as integral 
to any credible strategy for sustainable and 
innovative low-carbon economic growth. 

The dichotomy between “bottom up’ and 
‘top down is false — it is like arguing over 
“whether a supertanker needs an engine or 
a captain’, as Christiana Figueres, executive 
secretary of the United Nations Framework 
Convention on Climate Change, has put 
it (go.nature.com/mievx3). The ‘engine’ 
of emissions reductions is inevitably bot- 
tom up, and it must combine efforts on 
efficiency, pricing and innovation. But a 
top-down strategy, with national goals and 
international negotiations, is also essential 


to orient these efforts, address common 
problems and provide the international 
assistance required for global progress. 

The Kyoto Protocol’s grand plan is dead 
— so too is its antithesis. Carbon pricing is 
proliferating. Last year’s United Nations con- 
ference in Durban, South Africa, launched 
negotiations for a global deal to be reached 
in 2015; this could reinforce, and help to link 
and orient, these emerging efforts. 


NEXT STEPS 

Three things need to happen. First, the United 
States must rejoin the global effort. Since Hur- 
ricane Sandy, New York City mayor Michael 
Bloomberg and New Jersey governor Chris 
Christie have opened the political window for 
renewed climate-change debate. The Obama 
administration needs to reinforce the message 
that the confluence of climate science and 
market economics demands a carbon price, 
whether by tax or trade. 

Second, the EU needs a coherent suite of 
policies for efficiency, pricing and innova- 
tion. The ETS is a means to pricing and a 
route to financing efficiency and innovation, 
but it is not the whole game. The most urgent 
problem is to shift investment from new 
coal plants to low-carbon sources. There are 
grounds for removing the accumulated sur- 
plus of ETS emissions allowances; placing a 
minimum price on future auctions of allow- 
ances could deter coal plants and make the 
system more robust against future shocks*. 

Third, many developing countries in the 
United Nations take a negative negotiat- 
ing position that is out of touch with their 
domestic progress and strategic interests. 
This is not a zero-sum game, and haggling 
over ‘sharing the burden is no solution. The 
emerging economies need to drop the blame 
game and use international negotiations to 
support their domestic efforts, so that clean, 
secure and sustainable energy overtakes 
carbon-intensive development worldwide 
as soon as possible. With these steps and 
initiatives elsewhere, a powerful coalition 
for effective emissions reductions could yet 
emerge and cement its progress in the global 
deal targeted for 2015. m 
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Skyscrapers such as New York’s Citigroup Center must contend with complex wind dynamics. 


Turbulent genius 


Allan McRobie enjoys a life of the audacious engineer who 
pioneered the windproofing of bridges and skyscrapers. 


unlikely gem, a biography of both a 
man anda field. It tells the story of Alan 
Davenport and the 50 years he spent creating 
the discipline of wind engineering to span 
the gaps between fluid dynamics, meteorol- 
ogy, structural engineering and architecture. 
The book reveals the backstory to many 


S= Roberts’ Wind Wizard is an 
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of the world’s more iconic structures. Here, 
for instance, are the World Trade Center; the 
Sears, CN and John Hancock Towers; the 
Citicorp Center; and a comparable compen- 
dium of epic bridges, all from the perspec- 
tive of their ability to withstand windstorms. 
Early wind-tunnel tests of the World Trade 
Center revealed the need for more realistic 
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wind modelling, spur- 
ring Davenport to 
establish the Bound- 
ary Layer Wind Tun- 
nel at the University 
of Western Ontario in 
London, Canada. This 
facility was designed 
to replicate the turbu- 
lent conditions of the 
lower atmosphere. 

Such projects are 
massive investments, 
often of billions of dol- 
lars, with thousands 
of lives at risk when 
extreme winds hit. The 
physics is complex and uncertain, the math- 
ematics intractable and the definitive experi- 
ment — building the full-scale structure and 
seeing what happens — cannot be done. It is 
a difficult problem, and the book describes 
how Davenport pieced together pragmatic 
theory and painstaking model testing to give 
rational, reliable predictions of performance. 

Roberts charts how each challenge led to 
improvements in procedure and theory. For 
example, in her descriptions of the young 
Davenport's meetings with Leslie Robertson, 
the World Trade Center's structural engineer, 
as early as 1964, you detect both the creation 
of a prudent yet record-breaking design and 
the emergence ofa field. Davenport replaced 
rudimentary rules of thumb for static pres- 
sures with a discipline. This tackled the com- 
plexities and uncertainties of the atmospheric 
boundary layer and the dynamic complica- 
tions of wind flows such as galloping, vor- 
tex shedding, buffeting and wake buffeting. 
It was from that design process — which 
inevitably makes for poignant reading given 
the events of 11 September 2001 — that the 
Boundary Layer Wind Tunnel emerged. It 
went on to become central to all such studies. 

Two of the projects studied in the tun- 
nel show the potential for disaster posed by 
skyscrapers. In 1978, a phone call from an 
inquisitive student to the structural engi- 
neering firm behind the 59-storey Citi- 
corp Center (now the Citigroup Center) 
— already built, and balancing on four huge 
columns high above mid-town Manhattan 
— prompted the shocking realization that 
the supporting calculations had omitted to 
take into account ‘quartering’ winds, which 
hit the building at 45 degrees. 

This story, well-known in structural- 
engineering circles, represents one of the 
nightmare scenarios. Roberts captures the 
heart-thumping horror of the moment, and 
the parts played by Davenport and Robert- 
son in the testing and 
emergency remedial 
action to strengthen 
the bracing that fol- 
lowed. Perhaps my 


Wind Wizard: Alan 
G. Davenport and 
the Art of Wind 
Engineering 


Princeton University 
Press: 2012. 288 pp. 
$29.95, £19.95 


For Nature's city 
special, see: 


only criticism of the book is that the 
student who telephoned is not named. I 
believe her to be Diane Hartley, who was 
then studying under David Billington 
at Princeton University — a surprising 
omission for that institution’s own press. 

The other difficult case is the John 
Hancock Tower in Boston. Its problems 
were more subtle, although equally 
alarming. Again, the issue involved wind 
forces from directions that had not been 
considered, and required the retrofit- 
ting of stiffening and dampers, at great 
expense, to make the structure safe. From 
now on, I shall refer students and profes- 
sors alike to Roberts’ clear account. 

I did begin to wonder whether the ulti- 
mate outcome of Davenport’s life-long 
effort was allowing financiers to inhabit 
lofty eyries without overly endanger- 
ing the people below. But the last chap- 
ter focuses on his determined efforts at 
disaster mitigation for the vulnerable. 
For example, in the Caribbean, he has 
worked on hurricane-resistant houses 
and was involved in numerous interna- 
tional initiatives that worked on disaster 
mitigation at a human scale. 

Roberts has written a largely equation- 
free book in which technical subtleties 
such as aeroelasticity and Davenport's 
statistical description of turbulent buf- 
feting are set out clearly, engagingly and 
accurately. Her precise, vivid phrases, 
such as vortices “pushing and shoving the 
structure this way and that like a gang of 
bullies’, will enliven my future lectures. 

Before opening 
Two of the the book, I had 
projects decided to look 
studied show out for two poten- 
the potential tial pitfalls. First, 
for disaster would the book 
posed by acknowledge the 


skyscrapers. alternative to Dav- 
enport’s statistical 


theory of buffeting — the rapid distor- 
tion theory developed by Julian Hunt? 
It does. Second, would the story of the 
famous 1940 Tacoma Narrows Bridge 
collapse in Washington state fall back 
on the lazy and inaccurate ‘resonance’ 
description that most physics textbooks 
adopt? It does not. Instead, Roberts gives 
faultless coverage of work by engineers 
Robert Scanlan and, more recently, Allan 
Larsen to explain the physics of what 
actually happened. 

This is my field, but I learned 
much from Roberts’ admirable book, 
and emerged with great respect for both 
Davenport and his chronicler. m 


Allan McRobie is a Reader in Structural 
Engineering at Cambridge University, UK. 
e-mail: fam@eng.cam.ac.uk 


Books in brief 


Islands: From Atlantis to Zanzibar 

Steven Roger Fischer REAKTION BOOKS 352 pp. £22 (2012) 

From Charles Darwin’s moment in the Galapagos to the cultural 
efflorescence of Minoan Crete, islands are “crucibles and cradles” — 
laboratories, havens, touchstones. In this tour of their biology, geology 
and culture, linguist Steven Roger Fischer offers a taste of the million 
or so mini-biomes studding Earth’s rivers, lakes and oceans. He is 

a brilliant guide, whether discussing the anti-cancer properties of 
the Madagascar periwinkle, Papua New Guinea’s 500 languages, 
the imagined isle where Shakespeare’s Prospero abjured his ‘rough 
magic’, or the very real threat climate change poses to many islands. 


Drinking Water: A History 
James Salzman OVERLOOK 320 pp. $27.95 (2012) 
Potable water permeates humanity’s past and is set to dominate its 
future. The United Nations estimates that by 2030, more than half 
of us will live in water-scarce areas. Environmental-policy specialist 
\ James Salzman goes with the flow in this absorbing chronicle of our 
S complex relationship with H20. He negotiates multiple currents: the 
1AMee ‘cures’ ascribed to sacred waters; the ongoing struggle to eradicate 
* microbes and dicey chemical compounds; urban waterworks and 
? politicized availability; scarcity and bulk water transfers; and today’s 
search for water — a quest awash with uncertainty. 


The Learning Brain: Memory and Brain Development in Children 
Torkel Klingberg OXFORD UNIV. PRESS 200 pp. $24.95 (2012) 

In this pragmatic treatise on how children learn, neuroscientist 
Torkel Klingberg homes in on working memory. Klingberg has 
marshalled swathes of research and pertinent case studies to show 
how gaps in this form of memory can lead to educational failure, 
and how training the young in tested techniques can help. Enriching 
his argument with findings — from the role of white matter to the 
corrosive effects of stress — he concludes that key pedagogic tools 
include ‘memory training’ to boost cognitive function, aerobic 
fitness, reduced anxiety and regular sleep. 


Watching Vesuvius: A History of Science and Culture in Early 
Modern Italy 

Sean Cocco UNIV. CHICAGO PRESS 336 pp. $45 (2012) 

Historian Sean Cocco looks anew at Vesuvius to reveal how early 
responses to it shaped modern volcanology. Now monitored 
closely — as befits a looming risk to at least a million people 

— in Renaissance and Baroque Naples the volcano was just 
becoming a focal point for scientific appreciation. Cocco argues 
that a combination of the city’s cultural traditions and the chain of 
eruptions that kicked off in 1631 helped to avert the early modern 
scientific eye from sky-gazing to the earthly wonders of geology. 


Walking Sideways: The Remarkable World of Crabs 


memes fl Judith S. Weis COMSTOCK PUBLISHING ASSOCIATES 256 pp. $29.95 (2012) 
Well Stalked eyes, formidable claws, sidling gait: crabs are found around 
y the globe and in environments ranging from deep-sea vents to 


bromeliad plants growing in trees. Biologist Judith Weis explores 

this crustacean cosmos with verve, touching on evolution, species, 
the habitats, anatomy and functions, behaviour, ecology and fisheries. 
wee at From the spotted orange Japanese spider crab (whose leg span can 
measure more than 3.5 metres) to the shell-swapping hermit crabs 
of Belize, this is a gripping overview of a remarkable family. 
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Dreaming of the bomb 


Istvan Hargittai explores a life and work of Manhattan 
Project leader, physicist J. Robert Oppenheimer. 


towering yet enigmatic figure among 
Anes! physicists, J. Robert 

Oppenheimer directed the US lab- 
oratory in Los Alamos, New Mexico, that, 
between 1943 and 1945, built the first atomic 
bombs. He earned the label ‘father of the 
atomic bomb and worldwide fame, and fea- 
tures in numerous books. In the latest, Inside 
the Centre, Ray Monk — biographer of Ber- 
trand Russell and Ludwig Wittgenstein — 
brings a philosopher's nuanced perception 
to Oppenheimer’s life and work. 

Oppenheimer grew up in a privileged 
upper-west-side Manhattan family, but 
felt burdened by being Jewish and “tried to 
pretend that he wasn’t’? in the words of his 
friend, the Nobel-prizewinning physicist Isi- 
dor Rabi. A lonely childhood was followed 
bya troubled youth; he even showed signs of 
destructive tendencies. Oppenheimer was 
trying, as he would all his life, to discover an 
identity and an avocation. 

Oppenheimer followed the customary path 
of budding US scientists of the time, com- 
pleting his education in Europe. In 1925, he 
joined Ernest Rutherford’s Cavendish Labora- 
tory in Cambridge, UK, where he was men- 
tored by future Nobel prizewinner Patrick 
Blackett. Rumours persist ofa bizarre incident 
in which Oppenheimer left an apple laced 
with a chemical — believed to be cyanide — 
on Blackett’s desk. In any case, Oppenheimer 
was unhappy: he had little aptitude for experi- 
mental physics. Moving to Max Born’s lab in 
Géttingen, Germany, a hotspot of theoretical 
physics, he became a top player. 

In 1929, Oppenheimer returned to the 
United States for good. He worked at the 
California Institute of Technology in Pasa- 
dena and the University of California, 
Berkeley, building up an American school 
of theoretical physics. Soon, an influx of bril- 
liant scientists fleeing the Nazi takeover in 
Europe arrived to bolster his efforts. Among 
the glowing successes were contributions to 
what later became known as the black-hole 
concept and astrophysics. By the time the 
field could contribute to the war effort, he 
and his colleagues were ready. 

For a long time, the well-to-do Oppen- 
heimer was oblivious to the economic dif- 
ficulties around him and had little interest 
in world affairs. His political awakening in 
the mid-1930s occurred as a consequence of 
the hardship he observed during the Great 
Depression and the intensifying persecution 
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of Jews in Germany. He was drawn to the 
Communist Party, although he always 
denied having been a card-carrying member. 
When nuclear fission was discovered in 
Germany in 1938, the Manhattan Project 
was initiated to develop an atomic weapon. 
Its final phase was bomb production — 
for which the Los Alamos Laboratory was 
created in 1943. This powerhouse drew in 
other Manhattan Project resources: brain- 
power from the Metallurgical Laboratory 
in Chicago; uranium-235 from Oak Ridge, 
Tennessee; and plutonium from Hanford, 
Washington. Oppenheimer, however, 
seemed an odd choice as leader, having never 
directed anything. What no one foresaw was 
his remarkable ability to inspire associates. 
Oppenheimer never regretted his role in 
making the bombs. He saw their deployment 
against Japan as helping to end the Second 
World War quickly, saving millions of lives, 
despite having killed some 150,000 Japa- 
nese in Hiroshima and Nagasaki. In 1947, 
he declared that “physicists have known sin’. 


Robert Oppenheimer in 1958. 


Later, he clarified that he meant the sin of 
taking pride in their achievements rather 
than the sin of having caused destruction. 
Once involved with the Manhattan Project, 
Oppenheimer gradually dissociated himself 
from communism. However, even while 
directing Los Alamos, he was constantly 
being investigated by US security organs 
over his communist activities and connec- 
tions. In his eagerness to demonstrate loyalty 
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to his country, Monk 
reveals, Oppenheimer 
lied despicably about 
friends and former 
pupils. For example, 
he unjustly accused his 
gifted former student, 
Bernard Peters, who 
had participated in 
anti-Nazi street-fights 
in Germany, of being a 
dangerous Red. 
After the war, 
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. ‘ RAY MONK 
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seen as a hero scientist. 
He chaired several committees, including the 
General Advisory Committee of the Atomic 
Energy Commission (AEC), which some- 
times caused conflict of interest. For example, 
the Pentagon gave up the idea of the hydro- 
gen bomb after Oppenheimer told them it 
was technically unfeasible. He then told the 
AEC that the Pentagon wasn't interested in 
developing the bomb. Spreading himself too 
thin also impaired his judgement: he humili- 
ated others, made powerful enemies and hurt 
his chances of maintaining a leading role in 
government affairs, which he craved. 
During the McCarthy era between 1950 
and 1954, Oppenheimer’s leftist past caught 
up with him. His concocted stories surfaced, 
and his only explanation was: “I was an idiot? 
Monk's presentation of the well-known story 
of the ‘Oppenheimer hearing’ before an AEC 
security panel is a highlight of the book. 
Oppenheimer had the highest level of secu- 
rity clearance because of his sensitive position. 
By the time his clearance was about to expire, 
his loyalty and trustworthiness had been 
questioned by a number of people. The AEC 
set up a personal security board to decide on 
an extension and, in 1954, many scientists tes- 
tified before it. The damaging testimony of 
nuclear physicist Edward Teller is often held 
responsible for Oppenheimer’s downfall. The 
most relentless advocate for a US hydrogen 
bomb, Teller viewed Oppenheimer as an 
obstacle to his efforts. But the ‘prosecution’ 
had already destroyed Oppenheimer’s verac- 
ity by the time Teller stepped into the witness 
stand. Teller’s testimony ultimately harmed 
him more than it did Oppenheimer. 
Oppenheimer was both a brilliant physicist 
and a poor politician; a sophisticated speaker 
and an inconsistent debater; an inspirational 
colleague and a disloyal friend. In this highly 
readable book, Monk makes great strides 
towards fully understanding the phenom- 
enon that was J. Robert Oppenheimer. m 


Istvan Hargittai is professor emeritus at 
Budapest University of Technology and 
Economics and author of Judging Edward 
Teller. 

e-mail: istvan.hargittai@gmail.com 
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THANATOLOGY 


Beyond the 


sports cars, dolls or door knobs. Pretty 

much anything that can be culled, 
catalogued and curated is being amassed by 
someone, somewhere. 

Richard Harris collects death. More than a 
decade ago, the former antique-print dealer 
and one-time anatomy student came across 
aseries of memento mori prints, such as a late 
eighteenth-century engraving that depicts 
a half-man, half-skeleton digging his own 
grave. That print, along with several hundred 
other works that Harris has since acquired, is 
now on show at Death, a marvellous exhibi- 
tion at London's Wellcome Collection. 

“It's a universal subject,” Harris said, as 
gallery workers put the finishing touches to 
the five-room, 300-piece exhibit. “We're all 
going to die.” 

The vast Wellcome Collection — compris- 
ing medical devices, texts and miscellanea 
— is an appropriate setting for Harris’ col- 
lection, which shattered attendance records 
when it was shown at the Chicago Cultural 
Center in Illinois earlier this year. But it is 
safe to say that Henry Wellcome, the million- 

aire who amassed the 
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grave 


Death’s multifarious faces in two London exhibitions 
exhilarate Ewen Callaway. 


of Others (2009). One of the work’s three 
chandeliers, which are made of hundreds of 
plaster-cast bones, illuminates the entrance. 

Bones abound at Death. Barthel Bruyn the 
Elder's sixteenth-century Portrait ofa Man/A 
Skull in a Niche is a two-sided painting: 
nobleman on one side, a skull on the other. 
One wonders whether its original owner 
oriented it according to his health and mood. 
Contrast that work with the Argentine col- 
lective Mondongos The Skull Series (2009) 
—a 2-by-2-metre skull made of miniature 
plastic books, buildings and a rubber duck, 
set against a backdrop that depicts Pacman 
video-game screens. ‘Conversation piece’ 
doesn't do it justice. Harris’s own favourite 
is also the smallest: June Leaf’s Gentleman on 
Green Table (1999-2000), a hunched skeletal 
form made of rusted tin, wire and screws. It 
is more evocative than any pile of calcium 
carbonate I’ve seen or held. 

Yet there is a sense of osseous overload. 
By the time you get to Marcos Raya’s imagi- 
native series of Mexican portraits, inspired 
by Day of the Dead folk art, with skeletons 
superimposed on each family member, you 
have seen bones forged from brass, papier 
maché and laser-cut paper. 

The most powerful pieces tackle death 
directly. In a series of 51 etchings, the Ger- 
man expressionist Otto Dix depicts his time 
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as a First World War artillery gunner. Storm 
Trooper Advancing Under Gas (1924) shows 
ghoulish gas-mask-clad soldiers emerging 
from a trench. A century earlier, Francisco 
Goya captured the horrors of the Peninsula 
War between France and Spain in a series 
of haunting etchings, The Disasters of War 
(1810-20). Corpses, a severed head and 
limbs dangle casually from a tree in An 
heroic defeat! With dead men!. These etch- 
ings, along with those of Jacques Callot from 
the end of the Thirty Years War, resemble 
war photojournalism in their matter-of-fact 
portrayal of brutality. 

Death gets a more informative treatment 
across town, at the Museum of London's fas- 
cinating Doctors, Dissection and Resurrection 
Men. If real-estate tycoons are the primary 
beneficiaries of London's redevelopment 
boom, archaeologists come in a solid sec- 
ond; they often gain access to long-buried 
historical sites when new construction peels 
back a layer of the city’s past. (Nature last 
year published the genome of the bacterium 
responsible for the Black Death, collected 
from bones excavated by Museum archae- 
ologists in a fourteenth-century plague pit.) 

In 2006, Museum of London archaeolo- 
gists unveiled a nearly 200-year-old cem- 
etery adjacent to the Royal London Hospital 
in Whitechapel. Their excavation revealed 
graves containing jumbles of bones from 
many people, as well as the odd turtle and cow, 
showing signs of amputation and dissection. 

With the surgical profession on the rise 
in the early nineteenth century, medical 
students at private anatomy colleges needed 
cadavers for study. Legitimate sources — 
executed prisoners — were scarce, so anat- 
omy schools sought the services of the grave 
robbers known as ‘resurrection mer. The 
text displays at the exhibition are enriched 
with a range of journal clippings, letters and 
other primary sources. 

Resurrection men could earn a handsome 
salary for digging up one body, and some 
even turned to murder. The slaying of an 
Italian boy and the trial of his murderers is 
told in detail, through video and documents. 
The killers were convicted and executed, and 
their bodies given over for dissection. The 
episode sparked widespread public revul- 
sion and led to the passage of the Anatomy 
Act of 1832. This allowed the “unclaimed 
and friendless” bodies of indigent hospital 
patients to be used for dissection. 

Among a sometimes intrusive welter of 
videos and interactive displays is the thought- 
ful The Body Beyond Death (2012), in which 
Londoners give their views on mortality. One, 
a young woman in a headscarf, explains why 
she would not want to donate her organs — 
poignantly showing how, for some, the dead 
are much more than a pile of bones. m 


Ewen Callaway is a reporter for Nature. 
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Lichens under threat 
from ash dieback 


The fungal pathogen Chalara 
fraxinea is killing ash trees 
(Fraxinus excelsior) throughout 
Europe. Also potentially under 
threat is the large diversity of 
lichens that these ash trees 
support. 

Using data from the UK 
National Biodiversity Network 
(www.nbn.org.uk), we found 
536 lichen species (corresponding 
to some 30% of UK lichens) that 
occur on ash. Of these, 84 are 
categorized as under threat in 
Britain using International Union 
for Conservation of Nature 
standards. 

For at least six of these 
threatened species, more than half 
of the records in the database are 
for specimens found on ash trees. 
This includes Fuscopannaria 
ignobilis, a lichen that receives the 
highest UK legislative protection 
status under Schedule 8 of the 
1981 Wildlife and Countryside 
Act, and Wadeana dendrographa, 
for which the United Kingdom 
has international conservation 
responsibility. 

Ash, along with non-native 
tree species such as sycamore 
(Acer pseudoplatanus), provided 
an alternative host for lichens 
affected by the catastrophic 
decline of elm trees during the 
1970s. If the UK ash population 
succumbs to dieback, the 
rescue effect for lichens is a 
consideration that should 
influence landscape management 
of non-native trees. 

Christopher J. Ellis, Brian 

J. Coppins, Peter M. 
Hollingsworth Royal Botanic 
Garden Edinburgh, UK. 
c.ellis@rbge.org.uk 


NIH chimps: don’t 
sell sanctuary short 


You quote Kathy Hudson, deputy 
director for science, outreach 
and policy at the National 
Institutes of Health (NIH), as 
saying: “In a perfect world, we 
would absolutely like to move 


all of the [retiring laboratory] 
chimps directly to Chimp Haven 
[sanctuary]” (Nature 491, 18; 
2012). This fails to acknowledge 
the NIH’s legal mandate to do 
just that. 

Since the passage of 
the Chimpanzee Health 
Improvement Maintenance 
and Protection (CHIMP) Act 
12 years ago, the NIH has done 
less than right by the law and by 
its research chimps in allowing 
labs to make discretionary 
decisions about the animals’ 
retirement. It claims not to have 
sufficient funding to provide 
housing for retired chimps at 
the federal sanctuary, while 
financing the expansion of labs 
to accommodate chimps. 

The CHIMP Act obliges the 
NIH to provide lifetime care for 
retired chimps. The agency’s 
fiscal-reserve cap does not 
restrict it from finding more 
funds to fulfil its mandate, or 
limit its responsibility to do so. 
Meanwhile, the federal sanctuary 
should not be going short. 
Theodora Capaldo New 
England Anti-Vivisection Society, 
Boston, Massachusetts, USA. 
theodoracapaldo@neavs.org 


NIH chimps: use 
existing facilities 


Louisiana's Chimp Haven 
sanctuary for retired laboratory 
chimpanzees is requesting a 
further US$2.55 million from 
the National Institutes of Health 
(NIH) to construct housing 
for 110 chimps about to retire 
from the New Iberia Research 
Center, part of the University of 
Louisiana at Lafayette (Nature 
491, 18; 2012). However, the 
sanctuary has almost hit the 
NIH funding cap of $30 million, 
and other facilities are 
already available at the Texas 
Biomedical Research Institute’s 
Southwest National Primate 
Research Center (SNPRC), of 
which I am director. 

The housing at the SNPRC 
is cost-effective, high quality 
and similar to some housing at 
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Chimp Haven. The SNPRC also 
provides extensive enrichment 
programmes for the animals. 

Besides housing, the NIH 
needs to consider the pressing 
health-care needs of ageing 
chimps and the capacity of 
institutions to meet them. 

The SNPRC’s medical 
capabilities are state-of-the-art. 
For example, four veterinarians 
are employed for 141 chimps, 
compared with one for 
130 chimps at Chimp Haven. 
Veterinarians at the SNPRC 
have 90 years’ combined 
experience working with 
chimps. And the SNPRC has an 
on-site pathology lab equipped 
for testing within minutes of a 
medical emergency, a facility 
not available at Chimp Haven. 

Given that the vacant 
facilities at the SNPRC were 
partly funded by a $1.5-million 
grant from the NIH, the 
proposed allocation of scarce 
federal research dollars to 
duplicate them at Chimp 
Haven seems wasteful. The 
Congressional Budget Office 
has estimated that the cost to 
the taxpayer of transferring 
the New Iberia animals and 
330 other NIH-owned chimps 
to Chimp Haven would be 
$56 million over the next 
5 years alone. 

John L. VandeBerg Texas 
Biomedical Research Institute 
and Southwest National Primate 
Research Center, San Antonio, 
Texas, USA. 
jlv@txbiomedgenetics.org 


NIH chimps: Texas 
lab is not a sanctuary 


Asa physician who provided 
testimony for the US Institute 
of Medicine report that found 
chimpanzee experiments to 
be scientifically unnecessary, 
Iam thrilled that the National 
Institutes of Health has declared 
some 110 of its chimps ineligible 
for research (Nature 491, 18; 
2012). 

But sending any of these 
chimps to the Texas Biomedical 


Research Institute in San 
Antonio, even for the short term, 
is no retirement. Laboratories are 
designed to facilitate research, 
not to provide high-quality, long- 
term care for chimps, which 
are cognitively, socially and 
emotionally complex animals. 
Furthermore, the Texas 
Biomedical Research Institute 
was fined more than US$25,000 
by the US Department of 
Agriculture in December 2011 
for violations of the Animal 
Welfare Act after three animals 
escaped from their cages (see 
go.nature.com/dnzsqa). 
John Pippin Physicians 
Committee for Responsible 
Medicine, Washington DC, USA. 


jpippin@pcrm.org 


Cuts endanger young 
scientists in Europe 


The Young Academy of Sweden 
has joined forces with the Young 
Academies of Germany, the 
Netherlands and Denmark to 
urge the leaders of the European 
Union to invest more, not less, 
in science in their upcoming 
budget (see go.nature.com/ 
ymjole). Short-term savings 
would have long-term costs and 
weaken Europe’ future scientific 
standing. 

The academies’ members 
are especially concerned that 
cuts could target the European 
Research Council (ERC), which 
has emerged as a funding model 
for junior researchers in high- 
profile international science. 

The ERC provides these 
young scientists with the 
funding to develop their own 
lines of research, rather than 
relying on the patronage ofa 
senior colleague. This strategy 
encourages promising scientists 
to stay in Europe and others 
to return from abroad. It also 
attracts research talent from 
around the world. 
Christian Broberger, Anna 
Sjéstrém Douagi The Young 
Academy of Sweden, Stockholm, 
Sweden. 
asd@sua.kva.se 
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COMPUTATIONAL MATERIALS SCIENCE 


Substitution with vision 


A method has been developed for predicting the stability and elasticity of certain alloys for millions of atomic configurations 
of the materials. This approach should help to identify materials with optimized properties. SEE LETTER P.740 


GUS L. W. HART 


arly civilizations began alloying copper 
Be arsenic, tin or zinc nearly 6,000 

years ago’, ushering in the Bronze Age. 
Even when more-abundant iron became a 
mainstay during the Iron Age, substitutional 
alloys of copper — in which some of the copper 
atoms were replaced with atoms of a different 
metal — were superior materials. So Roman 
foot soldiers were equipped with wrought-iron 
weapons, whereas Roman officers had swords 
made of bronze. Similarly, at the turn of the 
twentieth century, French scientists developed 
a steel that substituted some iron with vana- 
dium. This alloy, which had three times the 
tensile strength of competing steels, became 
an essential ingredient of the venerable Ford 
Model T (and of early French luxury cars). 

Materials substitution — the replacement of 
some of the atoms of a material with those of 
another (Fig. 1) — continues to be a key strat- 
egy for developing the materials of tomorrow, 
but predicting the properties of new alloys is 
remarkably difficult. On page 740 of this issue, 
Maisel et al.’ report a method for calculating 
from first principles both the elasticity and the 
thermodynamic stability of alloys*. 

The difficulty of developing improved 
materials is a bottleneck — perhaps the main 
bottleneck — to advances in new technologies. 
In 2011, to increase the pace of materials devel- 
opment, and to leverage impressive advances 
in computational materials science, US Presi- 
dent Barack Obama announced the Materials 
Genome Initiative, a project that aims to create 
an infrastructure of informatics and experi- 
mental tools for materials development in the 
United States’. Materials substitution is central 
to this initiative and has also been specifically 
referred to in recent calls for research proposals 
from US federal funding agencies. 

Many computational approaches have been 
developed to take advantage of the materials- 
substitution strategy, but Maisel and col- 
leagues’ report brings something new to such 
efforts. Their work is noteworthy because it 
demonstrates a clear correlation between the 
thermodynamic and elastic properties of alloys 
known as face-centred-cubic intermetallics, 


*This article and the paper under discussion’ were 
published online on 21 November 2012. 


Figure 1 | Atomic configurations. Substitutional alloys are formed when a fraction of the atoms of a metal 
are replaced with different atoms. Several atomic configurations of possible alloys are depicted here for a 
hypothetical case in which the atoms of the main metal (orange) form a square lattice. Substituted atoms 
are shown in black. Maisel et al.’ report a computational technique that allows rapid prediction of both the 
elastic stiffness and the thermodynamic stability of different atomic configurations of certain alloys. 


and because their approach can easily be 
applied to other alloy types. It therefore not 
only provides a fundamental understanding 
of the physical properties of materials, but also 
opens up opportunities for materials engineer- 
ing. Using the authors’ method, it may be pos- 
sible to tune the elastic stiffness of alloys using 
materials substitution. For example, alloys 
could be softened to make compounds for 
orthopaedic implants that integrate well with 
bone, in order to avoid the difficulties that 
arise when bone and implant materials have 
disparate elastic properties. 

The authors’ technique extends a com- 
putational methodology known as cluster 
expansion that is often used to calculate the 
properties of substitutional alloys. Cluster 
expansion involves two basic steps: first, cal- 
culate the target property for a number of dif- 
ferent atomic arrangements using quantum 
mechanics; and second, map this information 
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onto a simple model that accounts for the 
effects of atomic substitutions. In this way, 
one essentially ‘trains’ a computational model, 
which is then used to calculate the target prop- 
erty for any atomic configuration — instantly 
and with quantum-mechanical accuracy. 
Because computation of the target quantity is 
so efficient, cluster expansion can be used to 
simulate thermodynamic and kinetic proper- 
ties of atomic ensembles from first principles, 
and to screen hundreds of millions of atomic 
configurations for a specific property. 

Maisel and colleagues report two major 
advances in cluster expansion. First, they 
have expanded its use to calculate multiple 
properties in a single model; and second, 
they have used it to identify a specific relation- 
ship between those properties. Specifically, 
they combined two cluster expansions to 
predict both the thermodynamic stability 
and the mechanical stiffness of any atomic 


configuration in their target alloys. This 
revealed that the more thermodynamically 
stable the configuration, the stiffer the result- 
ing material. 

Many researchers use computation to iden- 
tify atomic structures of a given material that 
have desirable properties, often with little 
regard for whether those structures are fea- 
sible to make (thermodynamically stable). 
Maisel and co-workers’ approach shows that, 
at least in the case of elastic stiffness, hunt- 
ing for metastable structures that have better 
properties than stable structures — whether 
known or predicted — is essentially futile, and 
that researchers should focus on other mater- 
ials instead. That said, being able to predict 
both the stability and another target property 
ofa material will allow scientists to efficiently 
scan through sets of hypothetical materials 
and ‘see’ promising candidates, lending vision 
to an established computational approach. 
It is in this discovery mode that Maisel and 
colleagues’ work could contribute greatly to 
efforts such as the Materials Genome Initiative. 


PLANT ECOLOGY 


It remains to be seen how many other 
materials’ properties will be studied using the 
new approach. In principle, any property that 
directly depends on atomic configuration is 
within reach, but many properties of relevance 
to engineering are still difficult to compute 
in practice. Furthermore, some of the most 
important properties of materials depend not 
only on the atomic configuration of a fixed 
lattice, but also on microstructural elements 
— such as boundaries between microscopic 
crystals (grains), grain sizes and extended crys- 
tal defects. These remain beyond the reach of 
quantum-mechanical calculations. 

Still, high-throughput approaches*” for 
sifting through thousands, or tens of thou- 
sands, of candidate materials are poised to 
make a substantial contribution to soci- 
ety’s needs by generating large databases of 
information that will be of use to research- 
ers®’. These databases will be more effec- 
tive if the information they contain about 
physical properties is used to build computa- 
tional models that, in turn, could search for 


Forests on the brink 


An analysis of the physiological vulnerability of different trees to drought shows 
that forests around the globe are at equally high risk of succumbing to increases 


in drought conditions. SEE LETTER P.752 


BETTINA M. J. ENGELBRECHT 


ater is the most limiting factor for 
ecosystem diversity and produc- 
tivity worldwide. But the global 
climate is changing, and both warming and 
shifts in rainfall patterns are projected, which 
will leave large areas of the planet with less rain 
and a higher likelihood of extreme drought 
events’”. These changes will almost certainly 
affect forests, which cover more than 30% of 
the world’s land surface. Understanding these 
effects is imperative: forests play an integral 
part in carbon and water cycles, they provide 
timber and other products, and they are home 
to a vast diversity of plants, animals and micro- 
organisms. But forests occur in a wide range of 
climatic conditions, so it is a challenge to pre- 
dict how the vulnerability of trees to changes in 
water availability compares between different 
biomes. In this issue, Choat et al.’ (page 752) 
use a combination of physiological measure- 
ments of the vulnerability of trees to drought 
and of the drought stress they actually experi- 
ence in their natural habitats to show that for- 
ests worldwide are at high risk*. 
We might expect that trees in forests 


*This article and the paper under discussion? were 
published online on 21 November 2012. 


currently exposed to seasonal or multi-annual 
droughts, such as in “Mediterranean-type’ 
systems, are already well adapted and will 
therefore suffer less from an increase in 
drought conditions than trees in wet forests. 
Conversely, but equally reasonably, we could 
predict that trees in dry areas are already at 
their physiological limits and would therefore 
be more vulnerable to increased drought than 
trees in wet forests. To investigate these ques- 
tions, Choat and colleagues compared the vul- 
nerability of the tree water-transport system to 
drought in different species worldwide. 

In plants, water is transported through a 
tubing system, a tissue called xylem that is 
made up of a multitude of conduits. Loss of 
water vapour (transpiration) through stomata 
(pores) in the plants’ leaves generates suction 
that pulls water in the xylem from the soil 
through the roots and stem to the leaves — 
much like sucking water through a straw. On 
its way, the water provides crucial services 
to the plant: it is the medium for metabolic 
reactions, it transports nutrients and other 
substances, and it provides stability. However, 
the powerful suction that pulls water through 
the xylem brings with it the risk of pulling air 
through small holes, called pit pores, in the 
sides of the conduits. These air bubbles can 
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thermodynamically stable materials to meet a 
particular need’®, Maisel and colleagues’ work, 
coupled with automated model-building 
methods’, might help us to achieve that goal. m 
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block the xylem and impair water transport, 
just like sucking air into a broken straw. This 
process is called xylem embolism, and the 
higher the suction in the conduit, the more 
embolism occurs. 

The link between this physiology and 
drought conditions comes from the fact that 
suction increases with increasing transpiration 
and/or decreasing water availability in the soil. 
Plants can regulate their stomata to delay the 
increase in suction, but if water is not replen- 
ished, more and more conduits will become 
clogged, leading to hydraulic failure and the 
eventual death of the plant. However, different 
plant species have different xylem structures, 
so the vulnerability of a plant’s xylem con- 
duits to embolism, and therefore its ability to 
tolerate drought, are variable. 

The authors compiled data on the xylem 
vulnerability of 480 tree species from 183 sites 
worldwide, covering the broad range of 
climatic conditions in which forests occur. 
They included both angiosperms (flowering 
trees, such as oak and maple) and gymno- 
sperms (such as pine and cedar), which vary 
substantially in their xylem structure. Wher- 
ever the data were available, they also included 
the maximum suction occurring in the trees in 
their natural habitats. Combining these data 
enabled Choat et al. to explore how the suction 
that induces hydraulic failure in a given species 
compares with the suction that it actually expe- 
riences. If these values are close together, this 
represents a small ‘safety margin’ with respect 
to hydraulic failure and indicates that the 
species is at risk; if they are far apart, the species 
is likely to be able to withstand more intense 
drought conditions. 

The data show that, as expected, trees grow- 
ing in more arid conditions around the globe 
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50 Years Ago 


It seems to be generally agreed that 
the standard of self-expression in 
spoken and written English among 
sixth-form and undergraduate 
scientists and technologists is low. 
Various causes have been blamed 

... but in all the diagnoses and cures 
Ihave seen so far, all the emphasis 
has been on past failures by English 
experts and future remedies to 

be administered by other English 
experts. It is not my intention to 
dissociate English teachers from 
the problem altogether ... but I 
want to suggest that scientists and 
technologists themselves must take 
most of the responsibility for the low 
standards of self-expression in their 
professions, and that a major change 
of outlook on their part is the only 
thing that can bring a substantial 
improvement in the situation. 
From Nature 1 December 1962 


100 Years Ago 


Biologische und morphologische 
Untersuchungen tiber Wasser- und 
Sumpfgewéichse. By Prof. H. Gliick 
— Prof. Gliick has produced a 
portentous volume on the riparian 
flora, forming the third instalment 
of his work on water and swamp 
plants. Frankly, we do not find 
justification for the 600 or more 
pages of his book, and we fancy 
most readers who have been in 

the habit of using their eyes when 
observing or collecting plants will 
find but little to reward them for 
the trouble of its perusal ... No 
doubt a work of this kind possesses 
some value, but, as it appears to us, 
it excellently illustrates the truth 
of the saying that the secret of 
dullness lies in the attempt to write 
all one knows. Prof. Gliick gives 
the impression (perhaps unjustly) 
that he has written all he knows 
about his subject, and certainly he 
has jotted down a good deal that is 
already very familiar to others. 
From Nature 28 November 1912 


Figure 1 | Thirsty trees. Reports of drought-induced forest die-off*, such as that in Switzerland in 1999 
shown here, have increased in recent decades, suggesting that climate change is already having an impact 
on tree health in many locations. Choat and colleagues’ study’ of trees across the globe suggests that they 
are at high risk from even small increases in drought intensity. 


are better at withstanding xylem embolism. 
The exciting finding, however, is that angio- 
sperm trees in all forest biomes have converged 
on a risky strategy, operating at extremely 
narrow safety margins. This implies that 
these trees are already, under current condi- 
tions, on the verge of injurious levels of water 
availability, and that even a minor increase in 
drought intensity will induce levels of xylem 
embolism that will impair growth and lead 
to tree death. 

The suggestion that all forests are on the 
brink of succumbing to drought, and may 
already be responding to climate change, 
is supported by observations of increased 
drought-induced forest die-offs and tree 
mortality in many ecosystems* (Fig. 1). For 
gymnosperms, Choat et al. found wider safety 
margins, suggesting that these trees may 
have a higher tolerance to increased drought. 
However, even these trees are threatened by 
hydraulic failure, as recent regional die-offs 
of pines show’. Taken together, these studies 
sound a warning bell that we can expect to 
see forest diebacks become more widespread, 
more frequent and more severe — and that no 
forests are immune. The ramifications of this 
scenario are diverse and, in many respects, 
dire: forest mortality will be accompanied by 
changes in species composition, changes in 
ecosystem function and losses of services and 
biodiversity’. 

Advancing our knowledge of organismal 
responses to factors such as drought and 
temperature is essential to improving predic- 
tions of the consequences of climate change”. 
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Through their meta-analysis of the global dis- 
tribution of xylem vulnerability, Choat et al. 
have dramatically increased our understand- 
ing of the comparative vulnerability of forests. 
Nevertheless, the mechanisms that actually 
lead to drought-induced tree mortality still 
remain elusive; in fact, it is known that some 
species can survive complete hydraulic failure 
for extended periods of time’. Although many 
studies have assessed the response of plants 
to experimentally manipulated precipitation 
and/or temperature’, the results of these stud- 
ies do not lend themselves to comparisons of 
drought responses across biomes, because of 
differences in treatments and in the resulting 
drought intensities. A coordinated network 
of standardized experiments is needed to fur- 
ther advance understanding of climate-change 
responses in ecosystems worldwide. 

Our ability to forecast the consequences 
of drought for forests is also limited by the 
high regional uncertainty in current models 
for rainfall and drought prediction, for both 
long-term trends and extreme events’”. A 
fundamental lesson from Choat and col- 
leagues’ study is that even small changes in 
drought intensity can be expected to lead 
to mortality in forests all over the world. 
This only highlights the urgent need for 
climate models that return more-confident 
predictions. m 
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Magma chambers 
on aslow burner 


An assessment of crystallization processes occurring in magma chambers in the 
ocean floor finds an unexpected enrichment in trace elements, reviving an old 
theory of the cycling of magma in these chambers. SEE ARTICLE P.698 


ALBRECHT W. HOFMANN 


he world’s ocean basins are constantly 

being regenerated by an 80,000- 

kilometre-long volcanic system of 
mid-ocean ridges, where Earth’s mantle melts 
to form magma that eventually produces the 
basaltic floor of the oceans. The composi- 
tion of ocean-floor basalts is one of the main 
sources of information about Earth’s deeper 
interior. On page 698 of this issue, O’Neill and 
Jenner’ re-examine the chemical composi- 
tions of basaltic lavas from this global mag- 
matic system. They find new, and remarkably 
systematic, chemical relationships between 
the concentrations of ‘incompatible’ trace 
elements (so named because they are largely 
excluded from magmatic crystals) and that of 
magnesium oxide (MgO). 

As expected, the content of incompatible 
elements increases in the basaltic-liquid 
component (the melt) of magmas, because 
MgO-bearing crystals precipitate in sub- 
oceanic magma chambers (reservoirs), 
causing the MgO content of the liquid to 
decrease. But O’Neill and Jenner show that 
the observed incompatible-element increase 
is much greater than conventional crystalliza- 
tion processes can explain. Their proposed 
solution to this dilemma would require a 
revision in the way geochemists calculate 
the composition of parental magmas enter- 
ing magma chambers, and therefore also the 
composition of the mantle rocks from which 
these magmas are derived. 

When basaltic lava comes into contact with 
cold sea water, it is chilled into glass. Geo- 
chemists like to analyse such glasses because 
they preserve the chemical composition of the 
lava particularly well, and they can thus tell the 


researchers much about the composition of 
the underlying mantle in which the melt forms. 
However, this view of the mantle is blurred 
because there are several intervening stages 
between melt formation and the eruption of 
lava. These are: partial melting of the mantle 
at depth (greater than about 30km); extraction 
of the melt from the partially molten mush; its 
emplacement in shallow magma chambers; the 
formation and settling out of magmatic crys- 
tals in these chambers; and, finally, eruption of 
the remaining liquid on the ocean floor. 

Two fundamentally opposing views of 
the mantle composition inferred from these 
glasses have prevailed over the past 40 years. 
One holds that the mantle has an essentially 
uniform composition, and that the composi- 
tional variability of the erupted basaltic lavas is 
produced primarily by processes occurring in 
the shallow magma chambers. The other view 
holds that magma-chamber processes have 
only minor effects on the erupted lavas that can 
be easily corrected for, and that the variations 
in lava composition mainly reflect differences 
in the composition of the mantle source and in 
the specifics of the melting process. 

This latter view has gradually gained the 
upper hand, because much of the observed 
chemical variability of the lavas correlates 
with variations in the isotopic composition 
of the elements strontium, neodymium, 
hafnium and lead. These elements are the 
products of very slow radioactive decay, and 
therefore accumulate only during long resi- 
dence times in the mantle. The observed dif- 
ferences in isotopic composition can therefore 
not be produced in short-lived magma cham- 
bers, but require long-term differences in 
parent-daughter ratios in the (mantle) source 
of the melts. 
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A crucial requirement when going back- 
ward from observed compositions of erupted 
basalts to their mantle sources is to evaluate 
the effects of partial crystallization and loss 
of the crystals in magma chambers. This is 
widely assumed to involve ‘fractional crystal- 
lization, whereby newly formed crystals are 
immediately removed from chemical inter- 
action with the liquid. Laboratory experiments 
have shown that the crystallization process 
in ocean-ridge magma chambers invariably 
involves the magnesium-bearing mineral 
olivine. The net effect of this is that the MgO 
content of the liquid progressively decreases 
as freshly crystallized olivine is removed from 
the liquid, whereas there is an increase in the 
contents of incompatible trace elements (such 
as barium, thorium and neodymium) because 
they are excluded from the crystals. 

This was thought to be well understood — 
until O’Neill and Jenner plotted the incom- 
patible-element contents against MgO for 
two recently assembled global data sets*”. 
They found excellent linear correlations 
(with the expected negative slopes) between 
incompatible-element and MgO contents. 
However, they were startled to find that these 
slopes are consistently greater than the maxi- 
mum allowed from fractional-crystallization 
theory. 

If fractional crystallization does not explain 
this effect, what process does? One possibility 
is that lavas that have higher incompatible- 
element contents start out with systemati- 
cally lower parental MgO contents. But that 
would mean that the sources of these magmas 
could not contain olivine, even though this is 
the most common of all upper-mantle miner- 
als. Nevertheless, it has been proposed’ that 
some mid-ocean-ridge basalts are mixtures of 
liquids formed from peridotite, the ‘standard’ 
olivine-bearing mantle rock, and other liquids 
formed from eclogite or pyroxenite, which are 
olivine-free rocks that form from subducted, 
recycled oceanic basalts. Melts from such 
recycled basalts should also have a higher- 
than-normal content of incompatible elements 
anda lower-than-normal MgO content. Such 
recycled basalts should also have different 
isotopic compositions of neodymium, for 
example. However, the expected correlations 
between neodymium isotopes and MgO have 
not been documented for any global set of 
ocean-ridge basalts. 

As a way out of the dilemma, O'Neill and 
Jenner revive and generalize a model that 
was originally proposed by O’Hara’ and 
later modified by Albaréde’, but which has 
been mostly forgotten. This model envis- 
ages a magma chamber that is periodically 
refilled with fresh parental liquid from below. 
The fresh liquid mixes with the pre-existing 
liquid, and the mixture is tapped by a volcano, 
whereupon crystallization resumes. This ‘trick’ 
of replenishment with fresh parental magma 
keeps the MgO content of the liquid from 
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falling too rapidly and allows a much greater 
build-up of incompatible-element concen- 
trations in the residual liquid than would 
be possible with closed-system fractional 
crystallization. 

At first sight, these effects might seem to be 
of interest mostly to aficionados of the details 
of magma-chamber processes. But they imply 
a much-reduced role for chemical heterogeneity 
of the mantle, as well as for the effects of partial 
melting, because most of the incompatible- 
element variability is now ascribed to processes 
occurring in the magma chambers. 

How plausible is this model? O’Neill and 
Jenner propose that a global assemblage of 
magma chambers exists, in which crystalli- 
zation processes vary locally, but which as an 
ensemble conform to the O’Hara—Albarede 
model. Although this model is apparently 
quite successful in describing the global obser- 
vations, the question of why these locally vari- 
able crystallization processes should average 


out to this idealized model remains a mystery. 

The authors have tested their model by 
comparing predicted and measured partition 
coefficients of incompatible trace elements 
for crystals forming in magma chambers 
(the partition coefficient is the concentration 
of an element in a crystal divided by its con- 
centration in the liquid). For the most part, 
the agreement is impressive, but barium and 
potassium are significant exceptions. These 
elements behave like highly incompatible 
elements in the basalts. In other words, their 
partition coefficients should be close to zero, 
which is actually the case in mantle minerals. 
But their experimentally determined parti- 
tion coefficients in plagioclase (one of the 
main minerals that form in shallow magma 
chambers) are high enough to raise questions 
about the model. 

Clearly, further experimental work is 
needed to resolve these issues. In the mean- 
time, O’Neill and Jenner’s paper indicates the 


Decoding our 


daily bread 


The wheat genome is large and complex, and has defied complete sequencing. 
But the most comprehensive analysis so far of the plant’s genes will support 
efforts to optimize the supply of this vital food crop. SEE LETTER P.705 


PETER LANGRIDGE 


he bread wheat genome presents a 

significant challenge to researchers. At 

17 gigabases, it is about six times the 
size of the human genome, and it is hexaploid, 
meaning that it contains six sets of chromo- 
somes, which derive from three different 
genomes. So why bother to sequence such a 
difficult genome? Wheat is arguably the most 
important plant to humans. Bread wheat 
(Triticum aestivum) is the world’s most widely 
grown crop, covering more than 200 mil- 
lion hectares of land’ throughout temperate, 
Mediterranean-type and subtropical regions 
of both the Northern and Southern hemi- 
spheres. Although total production of wheat — 
681 million tonnes in 2011 (ref. 1) — is slightly 
lower than that of maize (corn) and rice, wheat 
is the primary carbohydrate and protein source 
for the world’s population’. For this reason, 
researchers around the world are tackling the 
challenge of the plant’s genome. On page 705 
of this issue, Brenchley et al? present a detailed 
analysis and assembly of wheat gene sequences 
that will provide a key resource for crop 
scientists. 


Systematic wheat breeding began around 
100 years ago, but farmers’ improvement of 
wheat strains by selective breeding can be 
traced back to the beginnings of agriculture 
almost 10,000 years ago’. The ‘Green Revo- 
lutior of the 1960s — a series of advances in 
agricultural research, technology and infra- 
structure — triggered a drastic improvement 
in wheat yields. However, wheat production 
has struggled to meet global demand, and an 
increasingly variable and unstable climate is 
adding to the problems of wheat supply. It 
has been calculated that wheat production 
must increase by about 60% by 2050 to meet 
predicted demand". This is a daunting chal- 
lenge, but one that is taken seriously by the 
international community, as emphasized by 
the recent decision of the G20 group of coun- 
tries to establish the international Wheat 
Initiative, designed to develop resources and 
capabilities to target wheat improvement®. A 
key objective of this initiative is to establish 
genomics resources so that new breeding tech- 
nologies can be effectively and rapidly applied 
to wheat (Fig. 1). 

The wheat genome is not only large, but 
also complex. It contains extensive stretches 
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need for a re-examination of the nature of a 
magmatic process that is volumetrically by 
far the most significant on Earth. An accu- 
rate assessment of the crystallization process 
is needed to infer the composition of the 
mantle from which the ocean-floor basalts 
are derived. m 
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omen 


of repetitive non-coding DNA, which make 
sequence assembly difficult. Where there are 
genes, it is often hard to differentiate between 
the three constituent genomes, because each 
has a related set of genes. And the order of the 
genes has been partly shuffled on several of 
the chromosomes, adding to the complexity. 
There are three strategies that can be adopted 
to generate and assemble a full sequence for 
such a problematic genome. One approach 
is to make clone libraries that contain long 
stretches of DNA (more than 100,000 bases in 
each clone) derived from each wheat chromo- 
some arm — there are 21 chromosomes (seven 
from each genome), giving 42 chromosome 
arms. The clones can be used to construct an 
overlapping series of DNA segments to produce 
a minimum tiling path for sequence assembly. 
This strategy is feasible because wheat is highly 
tolerant of chromosome changes and wheat 
lines are available in which each chromosome 
arm is present as a separate telosomic chromo- 
some’®, which can be readily separated from the 
rest of the genome. Groups from around the 
world are working on this project as part of the 
International Wheat Genome Sequencing 
Consortium’. 

The second approach is to assemble the 
sequences of the three diploid genomes that are 
the progenitors of the wheat genome, which is 
broken down into A, B and D genomes. The 
progenitor species of the A and D genomes 
are known to be Triticum urartu and Aegilops 
tauschii, respectively, and sequence information 
is available. The progenitor of the B genome is 
believed to have been a close relative of Aegilops 
speltoides, and sequence information can be 
obtained from this species or derived from the 
tetraploid wheat species Triticum durum, which 
carries the A and B genomes’. 

‘Shotgun sequencing’ is the third approach. 
Unlike the large-clone process, which uses 
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falling too rapidly and allows a much greater 
build-up of incompatible-element concen- 
trations in the residual liquid than would 
be possible with closed-system fractional 
crystallization. 

At first sight, these effects might seem to be 
of interest mostly to aficionados of the details 
of magma-chamber processes. But they imply 
a much-reduced role for chemical heterogeneity 
of the mantle, as well as for the effects of partial 
melting, because most of the incompatible- 
element variability is now ascribed to processes 
occurring in the magma chambers. 

How plausible is this model? O’Neill and 
Jenner propose that a global assemblage of 
magma chambers exists, in which crystalli- 
zation processes vary locally, but which as an 
ensemble conform to the O’Hara—Albarede 
model. Although this model is apparently 
quite successful in describing the global obser- 
vations, the question of why these locally vari- 
able crystallization processes should average 


out to this idealized model remains a mystery. 

The authors have tested their model by 
comparing predicted and measured partition 
coefficients of incompatible trace elements 
for crystals forming in magma chambers 
(the partition coefficient is the concentration 
of an element in a crystal divided by its con- 
centration in the liquid). For the most part, 
the agreement is impressive, but barium and 
potassium are significant exceptions. These 
elements behave like highly incompatible 
elements in the basalts. In other words, their 
partition coefficients should be close to zero, 
which is actually the case in mantle minerals. 
But their experimentally determined parti- 
tion coefficients in plagioclase (one of the 
main minerals that form in shallow magma 
chambers) are high enough to raise questions 
about the model. 

Clearly, further experimental work is 
needed to resolve these issues. In the mean- 
time, O’Neill and Jenner’s paper indicates the 


Decoding our 


daily bread 


The wheat genome is large and complex, and has defied complete sequencing. 
But the most comprehensive analysis so far of the plant’s genes will support 
efforts to optimize the supply of this vital food crop. SEE LETTER P.705 


PETER LANGRIDGE 


he bread wheat genome presents a 

significant challenge to researchers. At 

17 gigabases, it is about six times the 
size of the human genome, and it is hexaploid, 
meaning that it contains six sets of chromo- 
somes, which derive from three different 
genomes. So why bother to sequence such a 
difficult genome? Wheat is arguably the most 
important plant to humans. Bread wheat 
(Triticum aestivum) is the world’s most widely 
grown crop, covering more than 200 mil- 
lion hectares of land’ throughout temperate, 
Mediterranean-type and subtropical regions 
of both the Northern and Southern hemi- 
spheres. Although total production of wheat — 
681 million tonnes in 2011 (ref. 1) — is slightly 
lower than that of maize (corn) and rice, wheat 
is the primary carbohydrate and protein source 
for the world’s population’. For this reason, 
researchers around the world are tackling the 
challenge of the plant’s genome. On page 705 
of this issue, Brenchley et al? present a detailed 
analysis and assembly of wheat gene sequences 
that will provide a key resource for crop 
scientists. 


Systematic wheat breeding began around 
100 years ago, but farmers’ improvement of 
wheat strains by selective breeding can be 
traced back to the beginnings of agriculture 
almost 10,000 years ago’. The ‘Green Revo- 
lutior of the 1960s — a series of advances in 
agricultural research, technology and infra- 
structure — triggered a drastic improvement 
in wheat yields. However, wheat production 
has struggled to meet global demand, and an 
increasingly variable and unstable climate is 
adding to the problems of wheat supply. It 
has been calculated that wheat production 
must increase by about 60% by 2050 to meet 
predicted demand". This is a daunting chal- 
lenge, but one that is taken seriously by the 
international community, as emphasized by 
the recent decision of the G20 group of coun- 
tries to establish the international Wheat 
Initiative, designed to develop resources and 
capabilities to target wheat improvement®. A 
key objective of this initiative is to establish 
genomics resources so that new breeding tech- 
nologies can be effectively and rapidly applied 
to wheat (Fig. 1). 

The wheat genome is not only large, but 
also complex. It contains extensive stretches 
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need for a re-examination of the nature of a 
magmatic process that is volumetrically by 
far the most significant on Earth. An accu- 
rate assessment of the crystallization process 
is needed to infer the composition of the 
mantle from which the ocean-floor basalts 
are derived. m 
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of repetitive non-coding DNA, which make 
sequence assembly difficult. Where there are 
genes, it is often hard to differentiate between 
the three constituent genomes, because each 
has a related set of genes. And the order of the 
genes has been partly shuffled on several of 
the chromosomes, adding to the complexity. 
There are three strategies that can be adopted 
to generate and assemble a full sequence for 
such a problematic genome. One approach 
is to make clone libraries that contain long 
stretches of DNA (more than 100,000 bases in 
each clone) derived from each wheat chromo- 
some arm — there are 21 chromosomes (seven 
from each genome), giving 42 chromosome 
arms. The clones can be used to construct an 
overlapping series of DNA segments to produce 
a minimum tiling path for sequence assembly. 
This strategy is feasible because wheat is highly 
tolerant of chromosome changes and wheat 
lines are available in which each chromosome 
arm is present as a separate telosomic chromo- 
some’®, which can be readily separated from the 
rest of the genome. Groups from around the 
world are working on this project as part of the 
International Wheat Genome Sequencing 
Consortium’. 

The second approach is to assemble the 
sequences of the three diploid genomes that are 
the progenitors of the wheat genome, which is 
broken down into A, B and D genomes. The 
progenitor species of the A and D genomes 
are known to be Triticum urartu and Aegilops 
tauschii, respectively, and sequence information 
is available. The progenitor of the B genome is 
believed to have been a close relative of Aegilops 
speltoides, and sequence information can be 
obtained from this species or derived from the 
tetraploid wheat species Triticum durum, which 
carries the A and B genomes’. 

‘Shotgun sequencing’ is the third approach. 
Unlike the large-clone process, which uses 
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Figure 1 | Vital grains. Brenchley and colleagues’ detailed analysis’ of the bread wheat genome will help 
scientists to identify breeding strategies to optimize yield from this crucial crop. 


sizeable fragments to develop an approximate 
sequence ‘map, the shotgun approach relies on 
sequence overlap in large numbers of much 
shorter sequences to assemble a genome. A 
public resource of shotgun sequences for 
each chromosomal arm of the wheat genome 
is close to completion’. However, although 
sequencing technologies are improving rap- 
idly, such that shotgun sequencing of the entire 
hexaploid wheat genome is feasible, reliable 


BOX1 


assembly of these sequences from such a large 
genome is not yet possible. 

The strategy of using large-insert clones to 
produce chromosome-arm-specific data is 
expected to yield the best-quality sequence 
assembly, but this will be a relatively slow pro- 
cess. However, the different approaches are 
not mutually exclusive and can be combined 
in a single effort, as Brenchley and colleagues 
have done. The authors’ extensive sequencing 


Insight into the barley genome 


Anyone who likes a nip of whisky ora 

slurp of a malted milkshake will be pleased 
to know that researchers are also working 
towards preserving the world’s supply 

of barley. On page 711 of this issue, the 
International Barley Genome Sequencing 
Consortium presents a genomic analysis 
for cultivated barley!!, Hordeum vulgare L., 
the world’s fourth-most-abundant 

cereal crop. 

The barley genome is less unwieldy 
than that of wheat, being only diploid and 
5.1 gigabases in size. The authors used 
information from existing large-clone 
libraries to assemble a 4.98-gigabase 
physical map of the genome and then 
added data from whole-genome shotgun 
sequencing to this framework. Their 
analysis has identified more than 26,000 
‘high-confidence’ genes (those for which 
similar genes have previously been identified 


in at least one other plant). It also reveals that 
substantial post-transcriptional processing, 
including high rates of alternative splicing, 
occurs to regulate gene expression in barley 
—as seen in wheat. 

Barley is more stress tolerant than 
some other cereal plants, so it is able to 
grow comparatively well in harsh climates 
or on nutrient-poor soils. The genomic 
information may help to explain these 
resilience mechanisms, and facilitate the 
breeding of varieties that have optimal gene 
combinations for certain growth conditions. 
Plant breeders are also keen to identify 
genetic traits that might be exploited to 
further improve the dietary-fibre content of 
barley grain — which is already relatively 
high — because increased fibre intake may 
help to reduce the incidence of conditions 
such as type 2 diabetes and colorectal 
cancer. Marian Turner 
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led to the identification of between 94,000 
and 96,000 genes. They compared these 
genes with sequence data from the progenitor 
genomes, and were able to assign around two- 
thirds of the genes to the A, B or D genomes. 
The approach was tested using sequences 
from individual chromosome arms — the 
researchers used shotgun sequencing of the 
isolated group 1 chromosomes (1A, 1B and 
1D) to develop a set of sequences to ‘train the 
methods, then used them to assign sequences 
to specific genomes. The assignment of genes 
to the A, B or D genome is particularly valu- 
able to wheat researchers because it allows 
them to differentiate genes and DNA markers 
from each of the three genomes, a difficult and 
time-consuming process. 

Although Brenchley et al. have provided 
extensive sequence information, we are still 
a long way from having a complete wheat- 
genome assembly. However, the authors’ data 
form a framework to which results from shot- 
gun sequencing of other wheat varieties and 
from the chromosome-arm sequencing project 
can be added, and the reliability of the assem- 
bly will rise as more groups add their findings. 

Brenchley and colleagues’ sequence analysis 
also reveals the extent of wheat-genome flex- 
ibility. The researchers find that the formation 
of a hexaploid genome from three diploid 
progenitors has led to significant losses of 
members of many gene families, but also an 
expansion of other families, including those 
involved in plant metabolism and growth. 
These changes are likely to have been a key fac- 
tor in the success of wheat in so many regions 
and climatic zones. 

So will these findings give us clues to new 
strategies for wheat improvement? And can we 
harness the dynamism of the genome to gener- 
ate varieties better able to cope with a variable 
environment? Wheat is grown largely in envi- 
ronments in which yield is being undermined 
by biotic and abiotic stresses. In fact, although 
wheat yields can exceed 12 tonnes per hectare 
(ref. 9), the global average is below 3 tonnes 
per hectare, and in non-irrigated environments 
the average is below 2 tonnes per hectare’. 
Heat and drought stress are foreshadowed 
as the major challenges for future wheat 
production, and plant breeding offers the 
best approach for responding to these pres- 
sures’. The current and future advances in 
understanding the wheat genome, and the 
genomes of other crop plants (Box 1), are likely 
to hold the key to developing breeding strat- 
egies that will optimize yields under variable 
conditions. = 
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Vitamins prime 


immunity 


The finding that derivatives of vitamin B can bind to an antigen-presenting 
protein that stimulates specialized immune cells suggests a novel mechanism 
by which the immune system detects microbial infections. SEE ARTICLE P.717 


WEI-JEN CHUA & TED H. HANSEN 


development and metabolism, many 

vitamins have crucial roles in the immune 
system. The functions of two lipid-soluble 
vitamins, vitamin D (ref. 1) and vitamin A 
(ref. 2), in modulating immune responses are 
already known. But on page 717 of this issue, 
Kjer-Nielsen et al.’ suggest a very different 
immune function for vitamins B2 (riboflavin) 
and B9 (folic acid), which are water-soluble 
vitamins. The authors provide evidence that 
molecules produced when bacteria metabo- 
lize certain B vitamins can activate a class 
of immune T cell called mucosa-associated 
invariant T (MAIT) cells. The proposal that 
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Figure 1 | Modes of antigen presentation. a, Conventional CD4* and CD8* 
T cells bind to antigens presented by MHC molecules on the surface of other 
cells. These antigens are typically short chains of amino acids (peptides) 
derived from proteins. The small balls extending from the peptide are amino- 
acid side chains that anchor the peptide in the MHC or are detected by the 
T-cell receptor. An example of a peptide derived from influenza virus that 
stimulates CD8* T cells is depicted below the cells. b, Another class of 

T cell, called NKT cells, recognizes antigens derived from lipid molecules 
that are presented by cells expressing a molecule called CD1d, which has deep 


MAIT cells detect infected cells through 
vitamin metabolites attached to the cells’ sur- 
face is the first suggestion that vitamins can 
act as antigens (substances that activate T and 
B cells of the immune system), and boosts 
our understanding of this novel arm of the 
immune system. 

T cells are major players in immunity, pro- 
viding protection against infection. The most 
common ones are CD4* and CD8* T cells, 
which are found throughout the body and 
carry a broad repertoire of antigen receptors 
on their surface. CD4* and CD8"* T-cell recep- 
tors bind to peptide antigens (fragments of 
proteins) that are ‘displayed’ on the surface of 
other cells by a cell-membrane protein belong- 
ing to the major histocompatibility complex 


NKT cell 


MAIT cells is shown. 
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(MHC) family. Development of these conven- 
tional CD4* and CD8* T cells depends on the 
presence of MHC molecules. 

By contrast, MAIT cells are a type of uncon- 
ventional T cell that are mostly found in the 
intestine, liver and lung* and that have only 
limited antigen-receptor diversity. MAIT-cell 
development depends on an MHC-related 
protein called MRI, which has been highly 
conserved over the course of mammalian 
evolution. Because the amino-acid sequence 
of MR1 is very similar to that of MHC mol- 
ecules, it has been speculated that MR1 binds 
to specific antigens that lead to MAIT-cell 
activation. Indeed, genetic and biochemical 
studies suggest that MR1 presents antigen for 
MAIT-cell activation, but the chemical nature 
of the antigen was unknown’. 

Intriguingly, and uniquely among known 
T-cell populations, MAIT-cell survival also 
depends on the commensal microbiota — the 
non-pathogenic microorganisms that live on 
and in the body. Furthermore, MAIT cells are 
activated by the presence of cells infected with 
a diverse range of bacteria and yeast strains 
(although not viruses)*’. Together, these 
findings hinted that MR1 is likely to bind 
microbial antigens, which are then presented to 
MATT cells. 

Kjer-Nielsen et al. have taken a key step 
in identifying exactly which antigens MR1 
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grooves that can accommodate the lipid chains of the antigen. An example 
ofa glycolipid that stimulates NKT cells is depicted. c, Kjer-Nielsen and 
colleagues’ demonstrate that a third type of antigen-presenting molecule, 
called MR1, presents metabolites of B vitamins to T cells called MAIT cells. 
The authors present a crystal structure of the MR1 molecule bound toa 
derivative of vitamin B9, which shows that the antigen-binding groove 
accommodates the pterin-ring structures characteristic of B vitamins and 
their metabolites. An example of a vitamin-B2 metabolite that stimulates 


presents by defining the crystal structure of 
MRI bound by a metabolite of folic acid called 
6-formy] pterin (6-FP). Their study stemmed 
from the serendipitous observation that using 
media containing folic acid enhanced the 
folding of denatured MR1 protein. 

The authors’ crystal structure shows that 
the MR1 antigen-binding groove specifically 
accommodates pterin rings, which are scaf- 
fold structures contained in some B vitamins 
and their metabolites. They also found that, 
although the 6-FP-MRI complex does not 
activate MAIT cells in vitro, related riboflavin 
derivatives, when bound to MRI, do. This 
is the first demonstration that MR1 binds 
vitamin-B metabolites and that some of these 
metabolites can activate MAIT cells, and it 
therefore defines a new model of antigen pres- 
entation to immune cells. It was already known 
that MHC molecules present peptides to CD4* 
and CD8* T cells, and that another MHC-like 
protein called CD1d presents lipid molecules 
to a class of T cell called NKT cells; now we 
have evidence that MRI presents vitamin-B 
metabolites to MAIT cells (Fig. 1). 

However, exactly how this antigen-pres- 
entation process is linked to immunity to 
microbes is still not fully clear. Kjer-Nielsen 
and colleagues suggest that a mechanism by 
which MAIT cells detect and control infec- 
tion is the display of vitamin-B metabolites on 
the surface of infected host cells. In support 
of this proposal, they cite the previous find- 
ing that the metabolic pathway that generates 
the antigenic molecules seems to be present 
only in microbes previously found to activate 
MATT cells in vitro®’. This correlative observa- 
tion will need to be tested empirically to deter- 
mine the importance of vitamin-B-metabolite 
presentation in controlling infection. 

It will also be interesting to identify the 
cellular location and mechanism by which 
vitamin-B metabolites bind to MR1 proteins, 
and the role of infection in this process. In 
germ-free mice, which do not contain any 
commensal bacteria and therefore also lack 
MAIT cells, the addition of certain commensal 
bacterial strains allows MAIT cells to develop’. 
But, curiously, not all of these commensal 
strains have the molecular pathway that makes 
the metabolites studied by Kjer-Nielsen and 
colleagues, which implies that other MR1 
ligands might be involved in microbial detec- 
tion by MAIT cells. In addition, some cell-sig- 
nalling molecules, such as interleukin-12 and 
interleukin-23, are known to activate MAIT 
cells*”, and this might mitigate the importance 
of activating signals derived from vitamin 
presentation by MRI. 

Another pertinent question is that of 
the roles that MAIT cells have in the gut. 
Vitamins help to orchestrate the relation- 
ships between mammalian host immunity, 
commensal gut microbiota and pathogenic 
microorganisms”’. For example, vitamin B9 
and its derivatives can serve as coenzymes 


in essential metabolic pathways”, and this 
vitamin is also required for the survival of a 
type of T cell called regulatory T cells”. Kjer- 
Nielsen and colleagues’ findings suggest 
that interactions between the host and gut 
microbiota might also be influenced by MR1- 
dependent presentation of microbial antigens 
to MAIT cells. 

A model that emerges from this idea is 
that, during early mammalian development, 
the colonization of the host by commensal 
bacteria’ provides vitamin metabolites that 
act as ligands for MRI, allowing MAIT cells 
to develop in the thymus. These cells then 
migrate to other organs, in particular the 
lungs, liver and gut, where they help to guard 
against bacterial infections. Although further 
work is required, it is attractive to speculate 
that MAIT-cell-dependent protection against 
pathogens could be augmented by dietary 
provision of vitamins or by pterin-based 
therapies. Such augmentation might enhance 
immunity to microbes, or even help to treat 
immunodeficiencies. = 
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Strongly correlated 


transport 


The field-effect transistor underlies microprocessor technology. A version of it 
has been demonstrated that tunes particle transport from an incoherent regime 
to astrongly correlated superfluid one. SEE LETTER P.736 


LINCOLN D. CARR & MARK T. LUSK 


he quantum transport of charge is a 
prominent feature in emerging models 
for future technologies, from micro- 
processors in computing to radical ideas 
for renewable-energy materials. In general, 
transport describes the motion of matter that 
has sufficient energy to overcome barriers, in 
contrast to quantum tunnelling, which occurs 
directly through barriers. Such transport 
becomes ‘quantum’ when the wave-like nature 
of particles is required to describe the process. 
There are different types of quantum trans- 
port, and devices that turn such flows on and 
off are called field-effect transistors (FETs)’ 
and are a mainstay of microprocessor technol- 
ogy. In this issue, Stadler et al.* demonstrate a 
micrometre-scale FET that is able to capture 
the dynamics of quantum transport in a regime 
called strongly correlated quantum-coherent 
superfluidity. Their contribution can best be 
explained by tracking how different FETs can 
be used to study the dynamics of charge ina 
progression of transport regimes. 
The basic concept of a FET is to use an 


electric field (gate) to control the transport of 
electrons or holes (notional particles formed 
by the absence of electrons) through a nar- 
row channel between two charge reservoirs 
(source and drain) in a semiconducting 
material. Quantum mechanics must be used 
to predict the overall behaviour of the collec- 
tion of charges, but the motion of each parti- 
cle amounts to a (biased) random walk. The 
particles’ dynamics is modelled by the semi- 
classical Boltzmann equation. However, under 
certain conditions, the wave-like nature of the 
charges becomes a major feature of transport 
properties. Then, different scattering paths 
can interfere like waves, building up construc- 
tive and destructive interference patterns, for 
instance, and the semi-classical Boltzmann 
equation is no longer correct. 

On the edge of this transport regime, a blend 
of particle and wave-like dynamics is observed. 
Such partially coherent dynamics is currently 
being explored*”, for example to efficiently 
move charges in solar-cell materials and in the 
latest molecular-electronics devices (Fig. 1a,b). 
Deeper into the quantum-coherent transport 
regime, which is facilitated by very cold (near 
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Figure 1 | Quantum transport in field-effect transistors (FETs). Ina solid-state FET, an electrode called a 
gate, which is usually made of gold, generates an electric field that controls charge transport through a narrow 
channel between two reservoirs (source and drain). a, Partially coherent transport (orange) between quantum 
dots underlies a new generation of solar-cell materials**, and quantum-dot FETs could be used to control and 
study the transport dynamics. The dots, which are made of assemblies of atoms (here silicon), are bridged by 
asmall organic molecule and placed on an insulating substrate. The small green atoms connect the molecule 
to the reservoirs. b, A quantum-interference FET (or QuIET) uses the wave-like nature of the electron (not 
shown) in different paths (orange arrows) through a single benzene molecule to control charge flow from 
source to drain’, both of which are made of gold. c, A FET based on a superconductor of electron pairs, 
formed at the interface of two oxides (lanthanum aluminate and strontium titanate), laid on a conducting 
substrate, at near-millikelvin temperatures’ can be used to explore coherent transport and quantum-phase 
transitions. d, Stadler and colleagues’ demonstrate a FET in which a laser beam serves as the device’s gate and 
a magnetic field applied to the FET allows the interactions between ultracold atomic pairs to be tuned. 


millikelvin) temperatures, the charges pair up 
into entities called Cooper pairs, their motion 
becomes weakly correlated (the motion of 
one pair depends on the motion of the oth- 
ers), and charge flows without resistance. This 
is the behaviour that underlies the properties 
of superconductors and their charge-neutral 
analogues, superfluids. A carefully crafted, 
cold version of the common FET has been 
developed for studying coherent-transport 
dynamics by forming such superconductors 
in complex oxide layers cooled in conventional 
dilution refrigerators (Fig. 1c). 

If the strength of the interactions between 
Cooper pairs is extreme, they become 
strongly correlated and the character of the 
system is fundamentally different. This is 
the regime of behaviour studied by Stadler 
and colleagues. The authors’ FET is able to 
capture the dynamics of strongly correlated 
superfluids’. 

Instead of working with charges directly, 
their apparatus is designed to study the motion 
of neutral atoms that pair up to make dia- 
tomic molecules in place of the Cooper pairs 


of electrons in a superconductor. Their quan- 
tum-coherent FET comprises a source and 
drain that contain a gas of fermionic lithium 
atoms (Fig. 1d). Fermions have half-integer 
spin and are exclusive — that is, identical fer- 
mions refuse to be in the same quantum state. 
Stadler et al. trap their fermionic atoms in laser 
and magnetic fields, and at sub-microkelvin, 
ultracold temperatures — 10-million-fold 
colder than the cosmic microwave background 
radiation of outer space. Whereas FETs in 
microprocessors are typically tens of nano- 
metres in size and have charge-carrier veloci- 
ties approaching 10’ centimetres per second, 
Stadler and colleagues’ FET is on the scale of 
tens of micrometres and operates with trans- 
port velocities of between 1 millimetre and a 
few centimetres per second. 

Significantly, the authors’ apparatus allows 
them to tune the attractive atomic interactions 
over many orders of magnitude, so that the 
transport behaviour can be made to transi- 
tion from the weakly correlated dynamics of 
the superconductors in the cold oxide FETs to 
the regime of strongly correlated superfluidity 


682 | NATURE | VOL 491 | 29 NOVEMBER 2012 


© 2012 Macmillan Publishers Limited. All rights reserved 


in ultracold fermionic atoms. Furthermore, 
the relatively large size of the authors’ FET 
allows them to view what is actually happen- 
ing in the transport channel, by means of high- 
resolution in situ optical imaging. The usual 
techniques for directly observing atoms, which 
involve atomic-force microscopy, cannot 
view such dynamics: we cannot zoom in ona 
solid-state FET in action. 

Strongly correlated quantum behaviour has 
also been observed in high-energy nuclear- 
physics experiments*” that create a state of 
matter known asa quark-gluon plasma, which 
is thought to have appeared shortly after the 
Big Bang. It has also been observed in ultra- 
cold quantum gases'*'*. These two systems 
fall into the category of extreme quantum 
matter. Holographic duality’* "* (a technique 

originating from string theory) infers that 
such strongly interacting quantum sys- 
tems are mathematically equivalent to 
weakly curved gravity in one higher spa- 
tial dimension than our usual space-time 
continuum of four dimensions. This extra 
dimension has the physical effect of acting as 
a ‘zoom on a quantum system. Holographic 
duality predicts that these strongly correlated 
systems approach a perfect fluid, having a ratio 
of viscosity to entropy density far lower than 
that of ordinary fluids’. 

Stadler et al. thus offer a continuously tun- 
able FET system that not only provides insight 
into quantum-coherent FET design, but also 
connects solid-state physics and extreme 
quantum matter in the new regime of quan- 
tum-coherent FET operation. = 
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Making sense of palaeoclimate sensitivity 


PALAEOSENS Project Members* 


Many palaeoclimate studies have quantified pre-anthropogenic climate change to calculate climate sensitivity (equi- 
librium temperature change in response to radiative forcing change), but a lack of consistent methodologies produces a 
wide range of estimates and hinders comparability of results. Here we present a stricter approach, to improve 
intercomparison of palaeoclimate sensitivity estimates in a manner compatible with equilibrium projections for future 
climate change. Over the past 65 million years, this reveals a climate sensitivity (in K W~' m?) of 0.3-1.9 or 0.6-1.3 at 95% or 
68% probability, respectively. The latter implies a warming of 2.2-4.8 K per doubling of atmospheric CO,, which agrees 


with IPCC estimates. 


radiation budget requires the definition of climate sensitivity: 

this is the global equilibrium surface temperature response to 
changes in radiative forcing (an alteration to the balance of incoming and 
outgoing energy in the Earth-atmosphere system) caused by a doubling 
of atmospheric CO, concentrations. Despite progress in modelling and 
data acquisition, uncertainties remain regarding the exact value of cli- 
mate sensitivity and its potential variability through time. The range of 
climate sensitivities in climate models used for Intergovernmental Panel 
for Climate Change Assessment Report 4 (IPCC-AR4) is 2.1-4.4K per 
CO, doubling’, or a warming of 0.6-1.2K per Wm™~ of forcing. 
Observational studies have not narrowed this range, and the upper limit 
is particularly difficult to estimate’. 

Large palaeoclimate changes can be used to estimate climate sensi- 
tivity on centennial to multi-millennial timescales, when estimates of 
both global mean temperature and radiative perturbations linked with 
slow components of the climate system (for example, carbon cycle, land 
ice) are available (Fig. 1). Here we evaluate published estimates of climate 
sensitivity from a variety of geological episodes, but find that intercom- 
parison is hindered by differences in the definition of climate sensitivity 


C haracterizing the complex responses of climate to changes in the 


Timescale 
Years Decades Centuries Millennia Multi-millennia // Myr 
> 
Clouds, water vapour, 
lapse rate, snow/sea ice 
Upper ocean 
—>- - -- -- Peewee nnn- > 
CH, CH, (major gas-hydrate feedback; 
for example, PETM) 
Vegetation 
Dust/aerosol Dust (vegetation mediated) 
Entire oceans 
Land ice sheets 
Carbon cycle 
Weathering 
——_> 
Plate tectonics 


Biological evolution 
of vegetation types 


Figure 1 | Typical timescales of different feedbacks relevant to equilibrium 
climate sensitivity, as discussed in this work. Modified and extended from 
previous work”*. Ocean timescales were extended to multi-millennial timescales”. 


between studies (Table 1). There is a clear need for consistent definition 
of which processes are included and excluded in the estimated sensitivity, 
like the need for strict taxonomy in biology. The definition must agree 
as closely as possible with that used in modelling studies of past and 
future climate, while remaining sufficiently pragmatic (operational) to 
be applicable to studies of different climate states in the geological past. 

Here we propose a consistent operational definition for palaeoclimate 
sensitivity and illustrate how a tighter definition narrows the range of 
reported estimates. Consistent intercomparison is crucial to detect sys- 
tematic differences in sensitivity values—for example, due to changing 
continental configurations, different climate background states, and the 
types of radiative perturbations considered. These differences may then 
be evaluated in terms of additional controls on climate sensitivity, such 
as those arising from plate tectonics, weathering cycles, changes in 
ocean circulation, non-CO, greenhouse gases (GHGs), enhanced water- 
vapour and cloud feedbacks under warm climate states. Palaeoclimate 
data allow such investigations across geological episodes with very dif- 
ferent climates, both warmer and colder than today. Clarifying the 
dependence of feedbacks, and therefore climate sensitivity, on the back- 
ground climate state is a top priority, because it is central to the utility of 
past climate sensitivity estimates in assessing the credibility of future 
climate projections’”. 


Quantifying climate sensitivity 

‘Equilibrium climate sensitivity’ is classically defined as the simulated 
global mean surface air temperature increase (AT, in K) in response to a 
doubling of atmospheric CO,, starting from pre-industrial conditions 
(which corresponds to a radiative perturbation, AR, of 3.7 W ms refs 1, 
3). We introduce the more general definition of the ‘climate sensitivity 
parameter’ as the mean surface temperature response to any radiative 
perturbation (S = AT/AR; where AT and AR are centennial to multi- 
millennial averages), which facilitates comparisons between studies 
from different time-slices in Earth history. For brevity, we refer to S as 
‘climate sensitivity’. In the definition of S, an initial perturbation ARo 
leads to a temperature response AT) following the Stefan—Boltzmann 
law, which is the temperature-dependent blackbody radiation response. 
This is often referred to as the Planck response’, with a value Sy of about 
0.3K W | m* for the present-day climate**. The radiative perturbation 
of the climate system is increased (weakened) by various positive (nega- 
tive) feedback processes, which operate at a range of different timescales 
(Fig. 1). Because the net effect of positive feedbacks is found to be greater 
than that of negative feedbacks, the end result is an increased climate 
sensitivity relative to the Planck response’. 


*Lists of participants and their affiliations appear at the end of the paper. 
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Table 1 | Summary of key studies. 


Label in Fig.3 Source Time window Explicitly considered Temperature data used Sand le bounds Notes 
forcings (KW~!m?) 
1 Ref. 2 LG Various Various 0.812027 LGM compilation based 
(data); on ref. 15 
0.81*84, (models) 
2 Ref. 6 LG GHG (CO2, CHa, N20), AT global = —5.8 + 1.4K; 0.728335 Scaling factor (0.85) for 
LI, AE, VG GLAMAP extrapolated : smaller S at LGM compared 
with model®? to 2 x COs (refs 12, 16) 
3 Ref. 86 LG GHG (COs, CHa), CLIMAP and ATaaagia 0.80 + 0.14 Value after authors’ suggested 
LI, AE, VG correction of CLIMAP 
temperatures 
4 Ref. 79 LG GHG (COs, CHa), MARGO®! SST based 0.62 +9.08 Model-based global estimate 
=0.12 
LI, AE, VG AT = —3.0+33K 
5 Ref. 76 GC GHG (COs, CHa) ATtrop 1.1+0.05 Author’s linear regression case. 
Value based on single-site tropical 
SST, and representation of global 
changes will be more uncertain 
6 Ref. 74 GC GHG (COs, CH,), ATaa (with 1.5 x polar 0.88 + 0.13 Author used a single value for 
LI, AE amplification) polar amplification. If 2 were 
used®?, then the central estimate 
is closer to 0.7 
7 Ref. 52 GC GHG (COs, CH,, ATaa (with 2 X polar 0.75 +013 Authors used a single value for 
N20), LI amplification) polar amplification. If 1.5 x were 
used74, then the central estimate 
becomes 1.0 
8 Ref. 52 GC GHG ATaa (with 2 X polar 1,520.25 Authors used a single value for 
(COz, CHa, N20) amplification) polar amplification. If 1.5 x were 
used’”*, then the central estimate 
becomes 2.0 
23-32 This work, GC (<800 kyr ago) GHG (COs, CHy, N20), ATnjH = model-based 0.66 + 0.22 to This covers the range of 
based on ref. 6 LI, AE, VG deconvolution of benthic 2.26+0.78 StcHe,x] given in Table 2 
8180 (ref. 51), scaled to 
global AT using a NH 
polar amplification on 
land of 2.75 + 0.25 
9 Ref. 85 GC GHG (COs, CHy, ATaa (with 2 X polar 0.75 2013 Authors used a single value for 
N20), L amplification) and polar amplification. If 1.5 x were 
1.5 ATas used’, then the central estimate 
becomes 1.0 
10 Ref. 39 GC GHG (COs, CHg, 36-record global SST 0:85+025 Polar amplification diagnosed, 
N20), LI, AE synthesis along with not imposed. Estimates made 
ATaageld: both in a spatially explicit sense 
and as direct global means 
11 Ref. 39 GC GHG (COs, CH,, 36-record global SST 1.05+0.25 As above 
N20), L synthesis along with 
ATaa&gld: 
12 Ref. 87 Early to Middle COz, ESS Using model-based 1.92 +0.14 to Forcing in ref. 44; temperature 
Pliocene (4.2-3.3 AT for Middle and 2.35+0.18 in ref. 87. Both derived in global 
Myr ago) Early Pliocene of (3.3 Myr ago); sense from model experiments 
2.4-2.9°C and 4°C. 2.60 + 0.19 
ACOz alkenone (4.2 Myr ago) 
13 Ref. 65 Miocene optimum Slow feedbacks Deconvolution of 0.78 + 10% f=0.71, B = 5.35, y = 1.3. Details 
to present day benthic 5180 (ref. 63) in Supplementary Information 
14 This work Eocene-Oligocene CO>. ESS (in the Model-based AT, 172807 Details in Supplementary 
(compilation) — transition(~34Myr — sense of ref. 44) with range of CO2 Information 
ago) values 
15 This work Late Eocene CO>. ESS (in the Model-based AT, 1e2e026 Details in Supplementary 
(compilation) versus present sense of ref. 44) with range of CO> values Information 
16 Ref. 78 Middle Eocene CO». Ice-free world. ATys (2 records) and 0.95 + 0.3 500 kyr timescale. ATy, = ATing- 
Climatic Optimum Event study (not ATmg (7 records). Temperatures from subtropics to 
(~40 Myr ago) affected by plate ACOz from alkenones high latitudes; no tropical data. 
tectonics and Hence biased to high-latitude 
evolution effects) sensitivity 
17 Ref. 78 Mid to Late Eocene CObz. Largely ice-free ATys(ref.71) and ATs. 0.95+0.3 Multi-million-year timescale. 
transition (41-35 world. Event study ACO>z = difference mid Adding uncertainty of +1°C 
Myr ago) (not affected by Eocene alkenone and to AT would enhance 1a limits 
plate tectonics and late Eocene 51!B to +0.45 KW! m? 
evolution effects) 
18 Ref. 88 Early Eocene COp. Ice-free world. ATmg (refs 89-91). ACO2 0.65 + 0.25 Central value recalculated in 
(~55-50 Myr ago) (potential influences based on modelling?! ref. 94. 
of plate tectonics and = marine organic carbon Note ref. 89 underestimated 
biological evolution isotope fractionation®? tropical SST 
not considered) and soil nodules?? 
19 This work PETM COz. Ice-free world. ATas (>6 records) and 1.0-1.8 Details in Supplementary 


(compilation) 


(~56 Myr ago) 


Event study (not 
affected by plate 
tectonics and 

evolution effects) 


ATmg (>11 records; 
equatorial to polar). 


ACOz based on deep ocean 


carbonate chemistry”* °° 


Information. Assumes all warming 
due to C input, and range of 
background COz and C-injection 
scenarios. ATys = ATing. Total 
range of Sis 0.7-2.2 KW! m?. 
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Label in Fig.3 Source Time window Explicitly considered Temperature data used Sand le bounds Notes 
forcings (KW! m?) 
20 Ref. 96 Cretaceous and CO>. Largely ice-free 1 Recalculated in ref. 94. No 
early Palaeogene world. (potential uncertainty range was reported, 
influences of plate nor salient details for assessment. 
tectonics and biological Figure 3b, c assumes +25% 
evolution not 
considered) 
21 Ref. 94 Cretaceous and COz. Largely ice-free AT after refs 52, 71. >0.8 No uncertainty range reported. 
early Palaeogene world. ESS in the ACOz based on ref. 60. This is a lower bound estimate 
sense of ref. 44 only 
22 Ref. 97 Phanerozoic COz. Ice-free situation. ATmg, ACO2 based 0.8-1.08 Model-based with extensive 


(Potential influences 
of plate tectonics and 
biological evolution 
not considered). 


on GEOCARBSULF 


uncertainty analysis 


These studies have empirically determined S for the Pleistocene and some deep-time periods from comparison between data-derived time series for temperature and for radiative change. Comparison of results 
between studies is greatly hindered by the different ‘versions’ of S used, as related to different notions of which processes should be explicitly accounted for, and by the different approaches taken to approximate 
global mean surface temperature. All uncertainties are as originally reported, but shown here at the level equivalent to 1c, estimated where necessary by dividing total range values by a factor of 2. All values for Sare 
reported in KW | m?, where necessary after transformation using 3.7 Wm per doubling of COs, bearing in mind the caveats for this at high COz concentrations as elaborated in the main text. GC, glacial cycles; 
LGM, Last Glacial Maximum; PETM, Palaeocene/Eocene thermal maximum; SST, sea surface temperature. See main text for details of forcings. Subscripts: aa, Antarctica; gld, Greenland; trop, tropical; ds, deep 


sea; global, global mean; mg, Mg/Ca; NH, Northern Hemisphere. 


We emphasize that all feedbacks, and thus the calculated climate 
sensitivity, may depend in a—largely unknown—nonlinear manner 
on the state of the system before perturbation (the ‘background climate 
state’) and on the type of forcing’ ’*. The relationship of S with back- 
ground climate state differs among climate models'*'*'*. A suggestion 
of state dependence is also found in a data comparison (Table 2)°, where 
climate sensitivity for the past 800,000 years (800 kyr) shows substantial 
fluctuations through time (Fig. 2). In contrast, its values for the Last 
Glacial Maximum (LGM) alone occupy only the lower half of that range 
(Fig. 2). That evaluation also suggests that the relationship of S with the 
general climate state may not be simple. 


‘Fast’ versus ‘slow’ processes 


Climate sensitivity depends on processes that operate on many different 
timescales, from seconds to millions of years, due to both direct response 
to external radiative forcing, and internal feedback processes (Fig. 1). 
Hence, the timescale over which climate sensitivity is considered is 
critical. An operationally pragmatic decision is needed to categorize a 
process as ‘slow’ or ‘fast’, depending on the timescale of interest, the 
resolution of the (palaeo-)records considered and the character of 
changes therein”. If a process results in temperature changes that reach 
steady state slower than the timescale of the underlying radiative per- 
turbation, then it is considered ‘slow’; ifit is faster or coincident, then it is 
‘fast’. Present-day atmospheric GHG concentrations and the radiative 
perturbation due to anthropogenic emissions increase much faster than 
observed for any natural process within the Cenozoic era*”. 


For the present, the relevant timescale for distinguishing between fast 
and slow processes can be taken as 100 yr (ref. 23). Ocean heat uptake 
plays out over multiple centuries. Combined with further ‘slow’ pro- 
cesses, it causes climate change over the next few decades to centuries to 
be dominated by the so-called ‘transient climate response’ to radiative 
changes that result from changing GHG concentrations and aerosols”!””*. 
After about 100 yr, this transient climate response is thought to amount to 
roughly two-thirds of the equilibrium (see below) climate sensitivity’”’. 
Climate models account for the fast feedbacks from changes in water- 
vapour content, lapse rate, cloud cover, snow and sea-ice albedo”, and 
the resulting response is often referred to as the ‘fast-feedback’ or 
‘Charney’ sensitivity’. To approximate the ‘equilibrium’ value of that 
climate sensitivity, accounting for ocean heat uptake and further slow 
processes, models might be run over centuries with all the associated 
computational difficulties”, or alternative approaches may be used 
that exploit the energy balance of the system for known forcing or 
extrapolation to equilibrium”’. 

In palaeoclimate studies, an operational distinction has emerged to 
distinguish ‘fast’ and ‘slow’ processes relative to the timescales of tempera- 
ture responses measured in palaeodata, where ‘fast’ is taken to apply to 
processes up to centennial scales, and ‘slow’ to processes with timescales 
close to millennial or longer. Thus, changes in natural GHG concentrations 
are dominated by ‘slow’ feedbacks related to global biogeochemical cycles 
(Fig. 1). Similarly slow are the radiative influences of albedo feedbacks that 
are dominated by centennial-scale or longer changes in global vegetation 
cover and global ice area/volume (continental ice sheets) (Fig. 1). 


Table 2 | Common permutations of S that may be encountered in palaeostudies 


Labelin S definition Explicitly considered Period in which it is practical S + lo for 800 kyr Stlo Ss 

Fig. 3 radiative perturbation to use the definition (KW! m?) for LGM (KW~! m?) for Pliocene (K W~! m?) 

23 Stco2] ARtco2] All (especially pre-35 Myr ago 3.08 + 0.96 2.63 = 0:57 12 

when LI ~ 0) 

24 Stco2, uy AR\cog, Ly <35 Myr ago 1.07 + 0.40 0.95 + 0.22 0.97 

25 Stco2, ul, vel AR{co2, Li, vel <35 Myr ago 0.86 + 0.27 0.80 + 0.19 0.82 

26 Stco2, Li, AE] ARtcoa, Ll, AE] <35 Myr ago, but mainly <800 kyr ago 0.90 + 0.42 0.72 +0.18 

27 Sico2, Li, AE, VG] ARtcoa, Ll, AE, VG] <35 Myr ago, but mainly <800 kyr ago 0.75 +0.29 0.63 +0.15 

28 Sicha] ARtcHc] <800 kyr ago 2.32 +0.76 1.97+041 

29 Sige, Ly ARtcue, Uy] <800 kyr ago 0.96 + 0.36 0.85 + 0.19 

30 SIGH, LI, Val ARr@ne, uy, ve] <800 kyr ago 0.78 + 0.23 0.73 + 0.16 

31 SIGH, LI, AE] AR(GHG, LI, AE] <800 kyr ago 0.82 + 0.36 0.66 + 0.16 

32 Sisic thaevei. - AR ice, ak vel <800 kyr ago 0.68 + 0.24 0.58+0.14 
S (second column) is presented with a subscript that identifies the explicitly considered radiative perturbations AR (third column, same subscripts as for S); all other processes are implicitly resolved as feedbacks 
within S. The period in which the various definitions of S are practical is determined by the availability of data for the explicitly considered processes. Subscript CO2 indicates the radiative impact of atmospheric 
COz concentration changes; LI represents the radiative impact of global land ice-volume changes; VG stands for the radiative impact of global vegetation cover changes; AE indicates the radiative impact of aerosol 


changes; GHG stands for the impact of changes in all non-water natural greenhouse gases (notably CO2, CH4 and N20). Columns 5 and 6 give calculated values for all suggested permutations of S for the past 
800 kyr or the LGM, respectively, based on a previous data compilation®. Mean values of all Spq for the LGM are about 13% smaller than for the whole 800 kyr, but lie well within the given uncertainties. This offset 
illustrates the state-dependence of S (see Supplementary Information). Column 7 gives examples for the Pliocene’*“*; Fig. 3b, c assumes +25% uncertainty in these. In these values the effects of orographic 


changes have been taken into account (see Supplementary Information section B2). 
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Figure 2 | Illustration of variability of climate sensitivity using a calculation 
of Sjco2,11 as defined in this work, for the past 800 kyr. a, Changes in global 
temperature. b, Changes in radiative forcing due to changes in CO, and surface 
albedo due to land ice. ¢, Calculated Stco2,19, which is only considered robust 
and calculated when AT < —1.5 K and ARjco2,11 < —0.5 W m’”, as indicated 
by the dotted red lines in a and b. In c, mean of S; + oo (dashed black lines 
indicate 9, the uncertainty of averaging) and 100-kyr running mean (blue line) 
are shown. Magenta marker in c denotes S; + o, for the LGM only (23-19 kyr 
ago) (a; is the square root of the sum of squares of individual uncertainties 
connected with different processes contributing to S;). The grey areas in 

a-c denote o, (standard deviation) uncertainties of S; for single points in time 
(points themselves are omitted for clarity). Details of data and the definition of 
the calculated uncertainties presented in this figure are available in 
Supplementary Information. In a and b, the dashed black lines indicate the 
preindustrial reference case (AT = 0 K, AR{co2,11) = 0 W m”). 


Other processes clearly have both fast and slow components. For 
example, palaeorecords of atmospheric dust deposition imply important 
aerosol variations on decadal to astronomical (orbital) timescales*”*°, 
reflecting both slow controlling processes related to ice-volume and 
land-surface changes, and fast processes related to changes in atmo- 
spheric circulation. A further complication arises from the lack of ade- 
quate global atmospheric dust data for any geological episode except the 
LGM’”**, even though that is essential because the spatial distribu- 
tion of dust in the atmosphere tends to be inhomogeneous and because 
temporal variations in some locations tend to take place over several orders 
of magnitude****. Moreover, palaeoclimate models generally struggle to 
account for aerosols, with experiments neither prescribing nor implicitly 
resolving aerosol influences. So far, understanding of aerosol/dust feed- 
backs remains weak and in need of improvements in both data coverage 
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and process modelling, especially because dust forcing may account for 
some 20% of the glacial-interglacial change in the radiative budget®”. 

So for comparison of results between studies, it is most effective to 
consider only the classical ‘Charney’ water-vapour, cloud, lapse rate, and 
snow and sea-ice feedbacks”® as ‘fast’, and all other feedbacks as ‘slow’. In 
addition, results from palaeoclimate sensitivity studies generally do not 
address the transient climate response that dominates present-day 
changes, but capture a more complete longer-term system response 
comparable with equilibrium climate sensitivity in climate models. 


Forcing and slow feedbacks 


The external drivers of past natural climate changes mainly resulted 
from changes in solar luminosity over time*’, from temporal and spatial 
variations in insolation due to changes in astronomical parameters**”, 
from changes in continental configurations'*”, and from geological 
processes that directly affect the carbon cycle (for example, volcanic 
outgassing). However, the complete Earth system response to such forcings 
as recorded by palaeodata cannot be immediately deduced from the 
(equilibrium) ‘fast feedback’ sensitivity of climate models, because of 
the inclusion of slow feedback contributions. When estimating climate 
sensitivity from palaeodata, agreement is therefore needed about which 
of the slower feedback processes are viewed as feedbacks (implicitly 
accounted for in S), and which are best considered as radiative forcings 
(explicitly accounted for in AR). 

We employ an operational distinction*'*° in which a process is con- 
sidered as a radiative forcing if its radiative influence is not changing 
with temperature on the timescale considered, and as a feedback if its 
impact on the radiation balance is affected by temperature changes on 
that timescale. For example, the radiative impacts of GHG changes over 
the past 800 kyr may be derived from concentration measurements of 
CO, CH, and NO in ice cores**’, and the radiative impacts of land-ice 
albedo changes may be calculated from continental ice-sheet estimates, 
mainly based on sea-level records*”*'. Thus, the impacts of these slow 
feedbacks can be explicitly accounted for before climate sensitivity is 
calculated. This leaves only fast feedbacks to be considered implicitly in 
the calculated climate sensitivity, which so approximates the (equilib- 
rium) ‘Charney’ sensitivity from modelling studies*’”*”. 
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Operational challenges 


All palaeoclimate sensitivity studies are affected by limitations of data 
availability. Below we discuss such limitations to reconstructions of 
forcings and feedbacks, and of global surface temperature responses. 
First, however, we re-iterate a critical caveat, namely that the climate 
response depends to some degree on the type of forcing (for example, 
shortwave versus longwave, surface versus top-of-atmosphere, and local 
versus global). The various radiative forcings with similar absolute mag- 
nitudes have different spatial distributions and physics, so that the con- 
cept of global mean radiative forcing is a simplification that introduces 
some (difficult to quantify) uncertainty. 

Astronomical (orbital) forcing is a key driver of climate change. In 
global annual mean calculations of radiative change, astronomical forcing 
is very small and often ignored*”*’. Although this obscures its importance, 
mainly concerning seasonal changes in the spatial distribution of inso- 
lation over the planet*'’*°*°°, we propose that the contribution of the 
astronomical forcing to AR may be neglected initially. When other com- 
ponents of the system respond to the seasonal aspects of forcing, such as 
Quaternary ice-sheet variations, these may be accounted for as forcings 
themselves. 

GHG concentrations from ice cores are not available for times before 
800 kyr ago, when CO, levels instead have to be estimated from indirect 
methods. These employ physico-chemical or biological processes that 
depend on CO, concentrations, such as the abundance of stomata on 
fossil leaves”, fractionation of stable carbon isotopes by marine phyto- 
plankton”’, boron speciation and isotopic fractionation in sea water as a 
function of pH and preserved in biogenic calcite**, and the stability fields 
of minerals precipitated from waters in contact with the atmosphere”. 
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Considerable uncertainties remain in such reconstructions, but improve- 
ments are continually made to the methods, their temporal coverage and 
their mutual consistency®’. Recent work has synthesized a high-resolu- 
tion CO, record for the past 20 million years (Myr; ref. 61), but new data 
and updated syntheses remain essential, particularly for warmer climate 
states. Also, proxies are needed for reconstruction of CH, and NO con- 
centrations in periods pre-dating the ice-core records”. 

Regarding the assessment of land-ice albedo changes, good methods 
exist for the generation of continuous centennial- to millennial-scale 
sea-level (ice-volume) records over the past 500 kyr (refs 49-51), but 
such detailed information remains scarce for older periods. A model- 
based deconvolution of deep-sea stable oxygen isotope records into 
their ice-volume and deep-sea temperature components”! was recently 
extended to 35 Myr ago”, but urgently requires independent validation, 
especially to address uncertainties about the volume-to-area relation- 
ships that would be different for incipient ice sheets than for mature 
ice sheets™®°. Before 35 Myr ago, there is thought to have been (virtually) 
no significant land-ice volume”, but this does not exclude the potential 
existence of major semi-permanent snow/ice-fields”*, and there remain 
questions whether these would constitute ‘fast’ (snow) or ‘slow’ (land-ice) 
feedbacks. The contribution of the sea-ice albedo feedback also remains 
uncertain, with little quantitative information beyond the LGM. 

Similar examples of uncertainties and limited data availability could 
be listed for all feedbacks. However, a ‘deep-time’ (before 1 Myr ago) 
geological perspective must be maintained because it offers access to the 
nearest natural approximations of the current rate and magnitude of 
GHG emissions’, and because only ancient records provide insight 
into climate states globally warmer than the present. Given that no past 
perturbation will ever present a perfect analogue for the continuing 
anthropogenic perturbation, it may be more useful to consider past 
warm climate states as test-beds for evaluating processes and responses, 
and for challenging/validating model simulations of those past climate 
states. Such data~model comparisons will drive model skill and under- 
standing of processes, improving confidence in future multi-century 
projections. For such an approach, palaeostudies may minimize the 
impacts of very long-term influences on climate sensitivity (for example, 
due to changes in orography, or biological evolution of vegetation) 
through a focus on highly resolved documentation of specific perturba- 
tions that are superimposed upon different long-term background climate 
states. An example is the pronounced transient global warming and 
carbon-cycle perturbation during the Palaeocene/Eocene thermal maxi- 
mum (PETM) anomaly’’”’, which punctuated an already warm climate 
state’. Note that deep-time case studies need to consider one further 
complication, namely that the radiative forcing per CO2 doubling may 
be about 3.7 Wm~ when starting from pre-industrial concentrations, 
but increases at higher CO, levels'’. Data-led studies may help with a 
first-order documentation of this dependence. Calculation of S from CO 
and temperature measurements using a constant 3.7Wm”~ per CO; 
doubling would (knowingly) overestimate S for high-CO, episodes. 
The difference with other, identically defined, S values for different cli- 
mate background states may then be used to assess any deviation from 
3.7 W m~ per CO, doubling. 

Regarding the reconstruction of past global surface temperature res- 
ponses (that is, AT in equation (1) below), again much remains to be 
improved. Most work to date (see Table 1) relies on one or more of the 
following: polar temperature variations from Antarctic ice cores (since 
800 kyr ago) with a multiplicative correction for ‘polar amplification’ 
(usually estimated at 1.5-2.0; refs 74, 75); deep-sea temperature varia- 
tions from marine sediment-core data with a correction for the ratio 
between global surface temperature and deep-sea temperature changes 
(often estimated at 1.5); single-site sea surface temperature (SST) records 
from marine sediment cores; or compilations of SST data of varying 
geographic coverage from marine sediment cores*”**’°”*. So far, few 
studies have included terrestrial temperature proxy records other than 
those from ice cores”, yet better control on land-surface data is crucial 
because of seasonal and land-sea contrasts. Continued development is 
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needed of independently validated (multi-proxy) and spatially represent- 
ative (global) data sets of high temporal resolution relative to the climate 
perturbations studied. 

Uncertainties in individual reconstructions of temperature change 
may in exceptional cases be reported to +0.5 K, but more comprehensive 
uncertainty assessments normally find them to be larger***'. Compilation 
of such records to determine changes in global mean surface tempera- 
ture involves the propagation of further assumptions/uncertainties, for 
example due to interpolation from limited spatial coverage, and the end 
result is unlikely to be constrained within narrower limits than +1 K even 
for well-studied intervals. Finally, comparisons between independent 
reconstructions for the same episode reveal ‘hidden’ uncertainties due 
to differences between each study’s methodological choices, uncertainty 
determination, and data-quality criteria, which are hard to quantify and 
often poorly elucidated. Take the LGM for example, which for tempera- 
ture is among the best-studied intervals. The MARGO compilation*! 
inferred a global SST reduction of -1.9 + 1.8K relative to the present. 
Another spatially explicit study” used that range to infer a global mean 
surface air temperature anomaly of —3*}3K. The latter contrasts with a 
previous estimate of —5.8 + 1.4K (ref. 82), which is consistent with 
tropical (30°S to 30°N) SST anomalies of —2.7+ 1.4K (ref. 83). 
However, that tropical range itself is also contested; the MARGO®! 
study suggested such cooling in the Atlantic Ocean, but less in the 
tropics of the Indian and Pacific Oceans (giving a global tropical 
cooling of only —1.7 + 1.0K). Clearly, even a well-studied interval 
gives rise to a range of estimates for temperature, and therefore for 
climate sensitivity. 

It is evident that progress in quantifying palaeoclimate sensitivity 
will not only rely on a common concept and terminology that allows 
like-for-like comparisons (see below); it will also rely on an objective, 
transparent and hence reproducible discussion in each study of the 
assumptions and uncertainties that affect the values determined for 
change in both temperature and radiative forcing. 


A way forward 


Here we propose a new terminology to help palaeoclimate sensitivity stud- 
ies adopt common concepts and approaches, and thus improve the poten- 
tial for like-for-like comparisons between studies. First we outline how our 
concept of ‘equilibrium’ S for palaeo-studies relates to ‘equilibrium’ S for 
modern studies. Then, we present a notation system that is primarily of 
value to data-based palaeo studies to clarify which slow feedbacks are 
explicitly accounted for. We finish with an application of the new frame- 
work, calculating climate sensitivity from a representative selection of 
palaeoclimate sensitivity estimates over the past 65 Myr, with a fair balance 
of climates warmer than the present to those colder than the present. 

When the AT response to an applied GHG radiative forcing AR is 
small relative to ‘pre-perturbation’ reference temperature, the ‘equilib- 
rium’ climate sensitivity S° (where a indicates actuo, for present-day) 
takes the form (see, for example, refs 4, 84): 

a Ag 2. = (1) 
AR Nf 
Ap + yy 4; 
i=1 

Here Ap is the Planck feedback parameter (—3.2 W m ?K~')and J}; 
(in Wm *K_') represents the feedback parameters of any number 
(N) of fast (f) feedbacks. We define feedback parameters in the form 
i, = AR/AT. S* is the ‘Charney’ sensitivity calculated by most cli- 
mate models in ‘2 X CO,’ equilibrium simulations, with a range of 
0.6-1.2 KW‘ m? in IPCC-AR4. However, the Earth system in reality 
responds to a perturbation according to an equilibrium climate sen- 
sitivity parameter S? (where p indicates palaeo), but the timescales to 
reach this equilibrium are long, so that the forcing normally changes 
before equilibrium is reached. To obtain S* from palaeoclimate sens- 
itivity S?, a correction is therefore needed for the slow feedback influ- 
ences. Using x; to represent any number (M) of slow (s) feedbacks, we 
derive the general expression (see Supplementary Information): 


29 NOVEMBER 2012 | VOL 491 | NATURE | 687 


©2012 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | PERSPECTIVE 


(2) 


This approach is contingent on the above-mentioned caveats of state- 
dependence, linearization (small AT), changes in slow feedbacks, and 
transient effects, where the last is relevant only in records of excep- 
tionally high temporal resolution. Knowledge of slow (4°) and fast (2%) 
feedbacks can be combined into a factor F = A$/(A' + 2S) that may 
then be used to back-calculate fast feedbacks out of palaeoclimate 
sensitivity S?. 

A recent study“ defined the term ‘Earth system sensitivity’ (ESS) to 
represent the long-term climate response of Earth’s climate system to a 
given CO, forcing, including both fast and slow processes. In our nota- 
tion, ESS = AR» co2 $?, where AR2xco2 is the forcing due to a CO2- 
doubling (3.7 Wm”). 

Here we introduce a more explicit notation regarding what was (not) 
included in the climate sensitivity diagnosis. It is the “specific climate 
sensitivity’ S;a.,..., expressed in KW '|m/’, where slow feedback pro- 
cesses A, B, and so on, are explicitly accounted for (that is, included in 
the forcing term, AR; ,,,..}). We use ‘LI to denote albedo forcing due to 
land-ice volume/area changes, “VG’ for vegetation-albedo forcing, ‘AE’ 
for aerosol forcing and ‘CO2’ for atmospheric CO, forcing (see also 
Table 1). This approach requires from the outset that a comprehensive 
view is taken of the various causes of change in the radiative balance. 


The most practical version of S to be estimated from palaeodata 
is Stco2,11 because Sico2,11 = Stco2) during times (pre-35 Myr ago) 
without ice volume, and because global vegetation cover changes, atmo- 
spheric dust fluctuations, and both CH, and N,O fluctuations (the two 
important non-CO, GHGs) generally remain poorly constrained by 
proxy data. Common reporting of S{co2,11; would bring results closer 
in line with the model-based concept of ‘equilibrium’ fast-feedback 
sensitivity. The above-mentioned issues with aerosol influences mean 
that it would currently be best for estimates from palaeodata to report 
both Stco2,11 and S[co2,L1,a}- 

Table 2 lists example estimates for S following the main potential 
permutations of the definition of S in our approach (for detailed break- 
downs, see Supplementary Information). The first example uses records 
of palaeodata since 800 kyr ago. The second example uses the same input 
data series®, but focuses only on the LGM; the contrast between exam- 
ples one and two thus highlights state-dependence. The third example 
lists estimates for Sjco2}, Stco2,11) and S[co2,L1,vq] from a more model- 
based assessment for the mid-Pliocene (~3-3.3Myr ago)'*, with 
AT = 3.3K relative to the present and AR¢o2 = 1.9W m™ due to CO, 
increase from 280 to 400 parts per million by volume (p.p.m.v,; ref. 44). 
The broad range of S values found within each example illustrates that 
comparison across different definitions unrealistically widens the range 
of values reported, notably towards high values, because omission of 
‘forcing’ due to the action of any slow feedbacks will cause overestima- 
tion of S (see also Fig. 3). 

For a first-order estimate of the range of S from palaeodata that 
approximates compatibility with the centennial timescale ‘equilibrium’ 
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Figure 3 | Evaluation of results from Tables 1 and 2. y-Axis labels refer to 
numbered rows in these Tables. a, Data summary by table row. b, Probability 
assessment using normal distributions (shifted where relevant). c, Probability 
assessment using lognormal distributions. S,x; refers to the climate sensitivity 
as defined in detail by the subscript X in Tables 1 and 2. For b and c, we assume a 
relative uncertainty of 25% for entries that lacked uncertainty estimates in the 
source studies. In a, rows from Table 2 are identified with either ‘plei’ or ‘plio’ to 
distinguish between the past 800 kyr and the Pliocene entries, respectively. The 
colour coding refers to broad geological intervals, as shown in the key. Boxes at 
right indicate which conditions were explicitly accounted for; that is, as 
‘forcings’ (in the CO,/GHG column, filled squares indicate GHG and open 
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squares CO,). Circles (data points in a) show central values where reported, 
error bars represent uncertainties as outlined in the Tables, at the 1o equivalent 
level. Arrow (case 21) indicates a value reported only as >0.8 KW! m’. Black 
dashed lines in a show 68% probability limits for all estimates that account for at 
least “CO2’ and ‘LI’, based on thick dashed lines in b and c, taking whichever 
68% value offers the widest (more conservatively estimated) margin. In b and 
c, the solid black line indicates the mode value (maximum), and the thin dashed 
lines the 95% probability limits. All distributions in b and ¢ are given as 
individual normalized frequencies (grey lines), and as mean normalized 
frequencies (red line). 
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Figure 4 | Equilibrium response of the global temperature as a function of 
CO, concentrations, based on three different approaches. a, This work, 

using data from the late Pleistocene of the past 800 kyr (ref. 6). b, Using data of 
the past 20 Myr (RW_11; ref. 61). c, Based on JH_12 (ref. 85) using similar data 
of the past 800 kyr as in a. d, Combination of all three approaches. Plotted areas 
include uncertainty estimates of one standard deviation. Because this work and 
JH_12 developed their approach only on Pleistocene data (climate being mainly 
colder than today), extrapolation of the impact of slow feedbacks to 2 * CO} is 


values of the IPCC-AR4', values need to be used that account for ‘CO,’ 
or ‘GHG as well as ‘LI’, and preferably also ‘AE’ and/or ‘VG’ (Tables 1, 2; 
Fig. 3). Such an assessment, excluding the case of row 21 in Table 1, 
yields a likely’ (68%) probability range of 0.6-1.3KW 'm’, and a 
95% range of 0.3-1.9KW 'm* (Fig. 3). These represent the widest 
margins out of two assessments, using either normal distributions 
with shifts when relevant (Fig. 3a), or lognormal distributions that in- 
herently allow asymmetry’ (Fig. 3b). These assessments include uncer- 
tainties as outlined in the source studies, as well as any unaccounted-for 
dependence on different background climate states, but exclude poten- 
tial additional uncertainties highlighted in this study. Inclusion of 
ESS values (approximated by Sjco2}) would extend the upper limit 
beyond 3K W | m’ (Fig. 3a). Future work following a strict framework 
for reporting and comparison of palaeodata may refine the observed 
asymmetry. 

Finally, following our conceptual framework, we can make a projection 
of equilibrium temperature change over a range of CO, concentrations 
while considering either slow and fast (or only fast) feedbacks (Fig. 4; see 
Supplementary Information for details). Including the known uncer- 
tainties associated with palaeoclimate sensitivity calculations, and com- 
paring with two previous approaches®'**, we find overlap in the 68% 
probability envelopes that implies equilibrium warming of 3.1-3.7K 
for 2 X CO, (Fig. 4), equivalent to a fast feedback (Charney) climate 
sensitivity between 0.8 and 1.0K W_ ' m”. For longer, multi-centennial 
projections, some of the slow feedbacks (namely vegetation-albedo and 
aerosol feedbacks) may need further consideration. However, their impact 
is difficult to estimate from palaeodata, because uncertainties are large, 
and because responses during climates colder than present may differ 
from responses during future warming. 


550 


200 250 300 350 400 450 500 550 
CO, (p.p.m.v.) 


not meaningful (we show only extrapolation with fast feedbacks). RW_11 in 
contrast also includes warmer climates with CO, up to 450 p.p.m.v., so that the 
applicable range with slow feedbacks extends to 450 p.p.m.v. For future climate 
with 2 X CO, and a short time horizon (<100 yr), only fast feedbacks are of 
interest (see d). Approaches partly disagree because of different assumptions. 
Uncertainties in this work (a) are estimated to be larger than they were in 
RW_11 (b) and JH_12 (c). For details of the equations and values used, see 
Supplementary Information. 


We have employed a new framework of definitions for palaeoclimate 
sensitivity. This reveals how a broad selection of previously published 
estimates for the past 65 Myr agrees on a best general estimate of 
0.6-1.3 KW! m’, which agrees with IPCC-AR4 estimates for equilib- 
rium climate sensitivity’. Higher estimates than ours may suggest different 
climate sensitivities during particular periods, but a considerable portion of 
the higher values may simply reflect differences in the definitions of 
palaeoclimate sensitivity that were used. 
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The mystery of recent stratospheric 


temperature tr ends 


David W.J. Thompson!, Dian J. Seidel’, William J. Randel’, Cheng-Zhi Zou’, Amy H. Butler’, Carl Mears®, Albert Osso’, Craig Long? 


& Roger Lin® 


A new data set of middle- and upper-stratospheric temperatures based on reprocessing of satellite radiances provides a 
view of stratospheric climate change during the period 1979-2005 that is strikingly different from that provided by 
earlier data sets. The new data call into question our understanding of observed stratospheric temperature trends and our 
ability to test simulations of the stratospheric response to emissions of greenhouse gases and ozone-depleting 
substances. Here we highlight the important issues raised by the new data and suggest how the climate science 


community can resolve them. 


stances and greenhouse gases have driven marked atmospheric 

cooling at stratospheric altitudes’. Ozone depletion is believed to 
have caused the preponderance of the cooling in the lower stratosphere 
(around 15-25 km altitude); both ozone depletion and increases in green- 
house gases are believed to have driven the cooling in the middle and 
upper stratosphere (around 25-50 km altitude)’. Stratospheric temper- 
ature trends play an important part in allowing us to distinguish between 
the climate responses to natural and anthropogenic climate forcings’. 
Although less widely discussed in either scientific or policy circles, stra- 
tospheric cooling is as fundamental as surface warming as evidence of the 
influence of anthropogenic emissions on the climate system. 

Unfortunately, observations of stratospheric temperatures are lim- 
ited. The surface temperature record extends for over a century and is 
derived from multiple data sources’. In contrast, the stratospheric tem- 
perature record spans only a few decades and is derived from a handful 
of data sources**. Radiosonde (weather balloon) measurements are 
available in the lower stratosphere but do not extend to the middle 
and upper stratosphere*®. Lidar (light detection and ranging) measure- 
ments extend to the middle and upper stratosphere but have very limited 
spatial and temporal sampling*’. By far the most abundant observations 
of long-term stratospheric temperatures are derived from satellite mea- 
surements of long-wave radiation emitted by Earth’s atmosphere. 

The longest-running records of remotely sensed stratospheric tempera- 
tures are provided by the Microwave Sounding Unit (MSU), the Advanced 
Microwave Sounding Unit (AMSU), and the Stratospheric Sounding Unit 
(SSU). The SSU and MSU instruments were flown onboard a consecutive 
series of seven NOAA polar-orbiting satellites that partially overlap in time 
from late 1978 to 2006; the AMSU instruments have been flown onboard 
NOAA satellites from mid-1998 to the present day’. 

The MSU, AMSU and SSU temperature measurements do not repre- 
sent temperatures at discrete height levels, but rather are representative of 
temperatures averaged over a continuum of altitudes described by the 
appropriate instrument ‘weighting functions’ (see, for example, Figure 2 
in ref. 4). The weighting function for the highest available MSU channel 
(MSU channel 4) peaks in the lower stratosphere near 20 km altitude. The 
weighting functions for the SSU instrument peak in the middle and upper 
stratosphere at 25-35 km (SSU channel 1), 35-45 km (SSU channel 2), 
and 40-50 km (SSU channel 3). 


ili he radiative effects of human emissions of ozone-depleting sub- 


Continuous time series of lower-stratospheric temperatures are 
derived by combining measurements from satellites that carried MSU 
instruments from 1978-2005 and AMSU instruments from 1998 to the 
present®. The lower-stratospheric MSU and AMSU data have been pro- 
cessed and combined by three different research groups: Remote Sensing 
Systems (RSS)"°, the University of Alabama-Hunstville (UAH)"', and the 
NOAA Center for Satellite Applications and Research (STAR). The 
processing methodologies and resulting lower-stratospheric temperature 
data have been published extensively in the peer-reviewed literature**. 

Global-mean lower-stratospheric temperatures derived from the 
three primary stratospheric MSU products are very similar to each other 
(red, purple and green lines in Fig. 1d (the red, purple and green lines in 
Fig. 1d are reproduced in Fig. 1h to facilitate comparison with model 
simulations, as discussed below); the large but short-lived warmings starting 
in 1982 and 1991 are due to the volcanic eruptions of E] Chichon and Mount 
Pinatubo, respectively). They are also very similar to lower-stratospheric 
temperatures estimated from radiosonde data**. The differences among 
the three MSU lower-stratospheric global-mean temperature time series 
are larger than those associated with separate estimates of global-mean 
surface temperatures’. And yet the differences between the MSU time series 
pale in comparison with those associated with the primary SSU products, as 
demonstrated below. 


The mystery 

Conflicting evidence 

Continuous time series of temperatures in the middle and upper stra- 
tosphere back to 1979 are based exclusively on SSU data (the AMSU data 
also sample the middle and upper stratosphere but are available only 
since 1998). The SSU data require correction for several unique issues 
before they can be used for climate studies (see discussion in ref. 4). For 
example, (1) the SSU instrument relies critically on a cell pressure modu- 
lator of carbon dioxide to determine the emission of stratospheric radi- 
ation from different altitudes. The cells in all SSU instruments leaked with 
time, causing changes in the altitudes being measured; (2) the amplitude 
of the atmospheric thermal tides—and thus the tidal corrections between 
successive satellite missions—is relatively large in the middle and upper 
stratosphere; (3) long-term increases in atmospheric carbon dioxide 
influence the weighting function of the instrument; and (4) there is no 
overlap period between several pairs of consecutive satellites. 


1Department of Atmospheric Science, Colorado State University, Fort Collins, Colorado 80523, USA. 7NOAA Air Resources Laboratory, College Park, Maryland 20740, USA. 2Atmospheric Chemistry 
Division, NCAR, Boulder, Colorado 80307, USA. *NOAA/NESDIS/Center for Satellite Applications and Research , College Park, Maryland 20740, USA. SNOAA/NWS/NCEP/Climate Prediction Center, 
College Park, Maryland 20740, USA. "Remote Sensing Systems, Santa Rosa, California 95401, USA. ’Department of Astronomy and Meteorology, University of Barcelona, Barcelona 08028, Spain. 
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Observations and CMIP5 
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Figure 1 | Global-mean stratospheric temperature anomalies since 1979. 
Time series of monthly mean, global-mean stratospheric temperature 
anomalies are shown for the altitude ranges, data sets and model output 
indicated. Red, blue, green and purple lines indicate results based on 
observations (observations are reproduced in the left and right panels). 

a-h, Grey lines indicate results from the coupled CCM runs available through 
the CCMVal2 archive (a—d) and from the AOGCM runs available through the 


The SSU data were originally processed for climate analysis by scien- 
tists at the UK Met Office in the 1980s'*"*. The data were further revised 
in 2008 to account for variations in the satellite weighting functions over 
time due to changes in atmospheric composition’*. However, the metho- 
dology used to develop the Met Office SSU product was never published 
in the peer-reviewed literature, and certain aspects of the original pro- 
cessing remain unknown. For this reason, the NOAA STAR recently 
reprocessed the SSU temperatures and published the full processing 
methodology and the resulting data in the peer-reviewed literature’®. 

The new NOAA SSU data provide an invaluable independent resource 
for assessing the reproducibility of the original Met Office SSU data. But 
the new data raise more questions than they answer, because they provide 
a strikingly different view of recent stratospheric temperature trends 
(compare the red and blue lines in Fig. la—c; the red and blue lines in 
Fig. la-c are reproduced in Fig. 1e-g to facilitate comparison with model 
simulations, as discussed below). The long-term variability and trends in 
global-mean temperatures for the uppermost SSU channel (SSU channel 
3) are relatively similar in both the Met Office and NOAA data sets. But 
the same cannot be said for the SSU channels that sample the middle 
stratosphere (SSU channels 1 and 2). The global-mean cooling in chan- 
nels 1 and 2 (around 25-45 km) is nearly twice as large in the NOAA data 
set as it is in the Met Office data set (Figs 1 and 2)'°. The differences 
between the NOAA and Met Office channels 1 and 2 global-mean time 
series do not arise from a discrete period of time, but rather increase from 
about 1985 to the end of the record"®. The differences between the NOAA 
and Met Office global-mean time series shown in Fig. 1 are so large they 
call into question our fundamental understanding of observed temper- 
ature trends in the middle and upper stratosphere. 


CMIP5 archive (e-h). Model runs are listed in Table 1 and were converted to 
SSU and MSU time series using the appropriate instrument weighting 
functions”"’. Time series are plotted so that their 1979-1982 mean anomalies 
are zero. Note that several CMIP5 models have poor vertical resolution at 
middle and upper stratospheric altitudes. For this reason, more model 
simulations are available at lower than at upper stratospheric levels 

(see Table 1). 


Disconnects between observations and models 

The story is further muddled when the observations are compared with 
attempts to simulate the past few decades of stratospheric climate change 
using climate models. Two classes of climate models commonly used in 
simulations of past climate are coupled chemistry—climate models (CCMs) 
and coupled atmosphere-ocean global climate models (AOGCMs). By 
definition, the CCMs explicitly simulate stratospheric chemical pro- 
cesses, whereas the AOGCMs explicitly simulate coupled atmosphere- 
ocean interactions. In principle, a coupled chemistry-climate model 
might also simulate coupled atmosphere-ocean interactions, and vice 
versa. But owing to computational limitations, most current CCMs are 
not AOGCMs, and vice versa. A key distinction between the model 
classes that is pertinent to this discussion is that in general the CCMs 
resolve the stratosphere more fully than do the AOGCMs. 

Simulations from CCMs forced with the time history of anthropogenic 
emissions are available via the CCM validation activity (Figs la-d and 
2a-d, results are from the CCMVal2 project; see Table 1 and ref. 17). 
Between 40 and 50 km (channel 3), global-mean temperature trends from 
both SSU products show more cooling than is simulated by the CCMs 
(Figs 1a and 2a; the model temperatures are weighted by the appropriate 
satellite weighting functions). Between about 35 and 45 km (channel 2), 
the Met Office version of the SSU data suggests that the models over- 
estimate the observed stratospheric cooling, whereas the NOAA SSU data 
suggest that the models underestimate it (Figs 1b and 2b). The most 
striking discrepancies are between about 25 and 35km (channel 1; 
Figs 1c and 2c). As demonstrated in refs 4 and 18, the Met Office SSU 
data are in reasonable agreement with the current generation of coupled 
CCMsat these altitudes. But as shown in Figs 1a—d and 2a—d, the cooling 
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Figure 2 | Trends in global-mean stratospheric temperatures between 1979 
and 2005. Trends in monthly mean, global-mean stratospheric temperatures 
are shown for the altitude ranges, data sets and model output indicated. 
Observed trends are denoted by the red, blue, green and purple vertical lines 
(observed trends are reproduced in the left and right panels). The normalized 
red, blue, green and purple probability distribution functions indicate the 
confidence ranges on the trend estimates, taking into account the effective 


in the new NOAA SSU channel | data is nearly twice as large as the 
cooling simulated by most of the CCMs. 

A similar story emerges when observations of global-mean stratos- 
pheric temperature are compared with the simulations of AOGCMs 
prepared for the upcoming IPCC Fifth Assessment Report (Figs le-h 
and 2e-h; results are from the Coupled Model Intercomparison Project 
Phase 5 simulations, CMIP5; see Table 1). Most of the CMIP5 models 
are not coupled CCMs and have considerably less vertical resolution at 
stratospheric altitudes than the models archived by CCMVal2. For this 
reason, relatively few CMIP5 model runs include altitudes sampled by 
SSU channels 2 and 3. 

The differences between the CMIP5 models and the observations are 
comparable to those noted in association with the CCMVal2 models in 
all SSU channels (Figs le-h and 2e-h). The CMIP5 models indicate 
considerably less cooling than both SSU products at about 40-50 km 
(channel 3); lie between the two SSU products at about 35-45 km (chan- 
nel 2); and provide a closer fit to the Met Office SSU data than the 
NOAA SSU data at about 25-35 km (channel 1). 

It is possible that the models are correct and that both SSU data sets 
are in error. But the CCMs and AOGCMs also exhibit smaller yet 
systematic discrepancies with observations in the lower stratosphere, 
which is sampled by the MSU channel 4 instrument (Figs 1d, h and 
2d, h). With few exceptions, the models underestimate the amplitude of 
the long-term cooling in the lower stratosphere (Figs 1d, h and 2d, h) 
and have difficulty simulating the amplitude of the response to the erup- 
tions of El Chichon and Mount Pinatubo there (Fig. 1d, h). Previous 
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number of degrees of freedom in the respective time series (for example, the 
95% confidence bounds correspond to the edges of the area that spans the 
middle 95% of the distribution function). a—h, Black bars show the histograms 
of the trends from the CCM runs available through the CCMVal2 archive 
(a-d) and from the AOGCM runs available through the CMIP5 archive 
(e-h). Each temperature trend bin is 0.01 K per decade wide. The total number 
of model runs is given in Table 1. 


studies have reported close agreement between trends in the MSU chan- 
nel 4 data and in CCMVal2 simulations"*, but those trend comparisons 
were done between observations of MSU channel 4 temperature and 
model output at specific height levels (that is, the model trends were 
shown as a function of height and not averaged over the MSU channel 4 
weighting function; see figure 2 in ref. 18, for example). 

The latitudinal profiles of the trends from the different SSU data 
sources are also remarkably different from those simulated by the cur- 
rent generation of CCMs (Fig. 3). The Met Office SSU data suggest that 
the cooling of the past few decades was relatively uniform with latitude 
(blue lines in Fig. 3a—c). In contrast, the NOAA SSU data suggest that the 
largest stratospheric cooling occurred at tropical latitudes, particularly 
between 25km and 45km (red lines in Fig. 3a-c). The differences 
between the Met Office and NOAA global-mean stratospheric temper- 
ature trends clearly derive primarily from tropical latitudes. The tropical 
stratospheric cooling indicated by the models is noticeably weaker than 
that indicated by the NOAA SSU data in the middle and upper stra- 
tosphere (Fig. 3a-c), and is generally weaker than that indicated by all 
MSU channel 4 products in the lower tropical stratosphere (Fig. 3d). 

What might cause cooling in the tropical stratosphere? The radiative 
effects of increasing carbon dioxide are modest below 40 km altitude’. 
Rather, at altitudes sampled by SSU channel 1, long-term tropical cooling 
is most likely to result from either anomalous rising motion, which 
decreases air temperature through expansion, or in situ ozone depletion, 
which decreases temperature by reducing the absorption of short-wave 
radiation. The two processes are closely related: rising motion leads to 
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Table 1 | Model runs used in this study 
CMIP5 model runs CCMVal2 model runs 


CanESM2* (5) AMTRAC3 
CCSM4 (6) CCSRNIES 
CSIRO-Mk3.6.0 (10) CMAM (3) 
FGOALS-s2 (3) EMAC 
GFDL-CM3* (5) LMDZrepro (3) 
GFDL-ESM2G (1) MRI (4) 
GFDL-ESM2M (1) NIWA SOCOL 
GISS-E2-H (15) SOCOL (3) 
GISS-E2-R (16) ULAQ 
HadCM3 (10) UMSLIMCAT 
HadGEM2-CC** (3) WACCM (4) 


HadGEM2-ES (4) 
NMCM4 (1) 
PSL-CM5A-LR (5) 
PSL-CM5A-MR (1) 
PSL-CM5B-LR (1) 
IROC4h* (3) 
IROCS (4) 
IROC-ESM*** (3) 
IROC-ESM-CHEM*** (1) 
PI-ESM-LR*** (3) 
PI-ESM-P*** (2) 
RI-CGCM3** (5) 
orESM1-M (3) 
orESM1-ME (1) 


umbers in parentheses indicate the number of ensemble members. The number of asterisks 
indicates at which level the model temperature data was used based on which levels are available in 
model output. No asterisks means used only in MSU channel 4 (any model with no output at 1 hPa). 
*Used in MSU channel 4 and SSU channel 1 (any model with output at 1 hPa). **Used in MSU channel 4, 
SSU channels 1 and 2 (any model with output at pressures below 1 hPa). ***Used in all channels (any 
model with output at pressures below 0.1 hPa). Model nomenclature is provided in ref. 17 and the IPCC 
Fifth Assessment Report (http://www.ipcc.ch/). 


decreases in ozone in the lower stratosphere through the vertical trans- 
port of low-ozone air from lower altitudes. The two processes are also 
both potentially implicated in recent stratospheric climate change. 

Rising motion in the tropical stratosphere occurs as part of the large- 
scale, equator-to-pole stratospheric mass circulation. Most coupled CCMs 
suggest that increasing greenhouse gases accelerate the stratospheric mass 
circulation'” “*. Such an acceleration is expected to be marked by decreases 
in tropical stratospheric ozone and temperatures (particularly in the lower 
stratosphere), and observations suggest that both changes are occurring. 
Ground-based and satellite measurements suggest that tropical lower- 
stratospheric ozone has decreased over the past few decades at a rate 
comparable to that predicted by the CCMs**”*, radiosonde data suggest 
that tropical lower-stratospheric temperatures have decreased since 
1979°?’° and the NOAA SSU data suggest that such tropical cooling 
extends to the middle and upper stratosphere (Fig. 3). (In principle, the 
acceleration of the mass circulation should also be marked by increases 
in temperatures and ozone concentrations in the extratropical stra- 
tosphere owing to the anomalous downward motion there, but the effects 
of the mass circulation at extratropical latitudes are opposed by the 
effects of polar stratospheric chemical ozone depletion in the Southern 
Hemisphere—the “Antarctic ozone hole’-—and masked by naturally high 
levels of year-to-year climate variability in the Northern Hemisphere.) 

If the new NOAA SSU data are correct, they suggest that the stra- 
tospheric mass circulation is accelerating at a rate considerably higher 
than that predicted by the CCMs, at least in the middle and upper stra- 
tosphere (that is, at the altitudes sampled by the SSU instrument). Again, 
itis possible that the models are correct and that the SSU data are in error. 
But the fact that the discrepancies between the magnitudes of the simu- 
lated and observed cooling in the tropical stratosphere extend to MSU 
channel 4, which samples the lower stratosphere and exhibits trends that 
are fairly reproducible from one data set to the next (Figs 1d, h, 2d, h and 
3d), suggest that model uncertainties should not be discounted. 


Moving forward to resolve the mystery 


Are the models missing a key aspect of stratospheric climate change? Or 
is there an error in the newly processed NOAA data? Which SSU data set 
is correct? Or are both in error? 
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Figure 3 | The north-south structure of zonal-mean stratospheric 
temperature trends between 1979 and 2005. a-d, Trends in monthly mean, 
zonal-mean stratospheric temperatures are shown for the altitude ranges, 
data sets and model output indicated. Error bars approximate the 95% 
confidence bounds. 


The latitudinal structure of stratospheric temperature trends is influ- 
enced by both radiative processes and variability in the stratospheric 
mass circulation. As noted above, the predicted acceleration of the stra- 
tospheric mass circulation” ** should lead to enhanced stratospheric 
cooling at tropical latitudes, and such cooling is found in the middle 
and upper stratosphere in both the NOAA SSU data and the CCMVal2 
simulations (Fig. 3). However, the magnitudes of the predicted accele- 
ration are not well constrained by theory, and the latitudinal structure of 
the trends exhibits considerable variability from data set to data set and— 
to a lesser extent—model to model”? (Fig. 3). Trends in the stratospheric 
mass circulation are difficult both to detect and to predict. 

In contrast, trends in global-mean stratospheric temperature are rela- 
tively simple to constrain quantitatively. The influence of the stra- 
tospheric mass circulation on temperature trends is negligible in the 
global-mean temperature because the regions of upward and downward 
motion average out. Thus trends in global-mean stratospheric tempera- 
tures are driven almost entirely by the radiative effects of changes in 
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stratospheric composition, primarily increases in carbon dioxide and 
changes in ozone concentrations, but also changes in the concentrations 
of water vapour, aerosols and other trace gases. If the NOAA SSU data 
are correct, then both the CCMVal2 and CMIP5 models are presumably 
missing key changes in stratospheric composition. 

What might give rise to the discrepancies between observed and 
simulated global-mean stratospheric temperatures highlighted here? 
The long-term increases in stratospheric concentrations of carbon 
dioxide are probably well constrained by observations and in models 
owing to the fact that carbon dioxide is largely inert and thus well mixed 
in the atmosphere. Simulations of stratospheric water vapour trends and 
their effects on temperature vary considerably from model to model’*. 
But the effects on temperature of stratospheric water vapour trends are 
much more important in the lower stratosphere than they are in the 
middle and upper stratosphere’. Therefore, uncertainties in simulated 
stratospheric water vapour trends may contribute to the discrepancies 
between simulated and observed temperature trends in the lower stra- 
tosphere, but they seem unlikely to contribute significantly to the dis- 
crepancies in the middle and upper stratosphere. Trace gases such as 
nitrous oxide, methane and fluorinated greenhouse gases are not 
believed to have had a pronounced effect on trends in the middle and 
upper stratosphere~’*. And multiple observational sources suggest that 
the overall trends in stratospheric aerosols were very small over the SSU 
period (about 1979-2005)°*'. Hence, the pronounced discrepancies 
between simulated and observed global-mean stratospheric temperature 
trends are most probably due to one of the following two possibilities. 
(1) The observations may be in error. The MSU channel 4 tempera- 
ture record is robust from one data set to the next, so we consider it to be 
unlikely that uncertainties in the MSU channel 4 data can account for 
the discrepancies between modelled and observed lower stratospheric 
temperatures shown here. Uncertainties in middle and upper strato- 
spheric temperatures derived from the SSU instrument are much larger. 
(2) The simulated ozone trends may be in error. The observed and 
simulated global-mean ozone trends are very similar in both the middle 
and upper stratosphere’®. We therefore consider it to be unlikely that the 
differences between modelled and observed temperature trends in the 
middle and upper stratosphere can be explained by differences in ozone 
trends at these altitudes. Uncertainties in ozone depletion in the lower 
stratosphere’ may help to account for the discrepancies between mo- 
delled and observed trends in temperatures there. 

How might the climate community resolve the mysteries raised by the 
new SSU data? First, the methodology used to generate the original Met 
Office SSU data remains undocumented and so the climate community 
are unable to explain the large discrepancies between the original Met 
Office and NOAA SSU products highlighted here. The World Climate 
Research Programme’s Stratospheric Temperature Trends Assessment 
Panel (of which several authors of this study are members) has encour- 
aged the scientists who generated the original Met Office data set to 
publish the methodology, but they are now retired. We encourage the 
Met Office to allocate resources towards the recovery and publication of 
as much of the original SSU metadata as possible. 

Second, the SSU data should be processed by at least one additional 
independent research group. Similar controversies regarding surface and 
tropospheric temperature changes over the past few decades have moti- 
vated tests of the reproducibility of trend estimates. Other key data sources 
are now routinely vetted, processed and published by a number of research 
organizations: scientists have produced at least three independent MSU 
temperature products, five independent radiosonde temperature pro- 
ducts’ and five global surface temperature products for climate research 
(see discussion in refs 4 and 7). The SSU data have been processed by only 
two independent research groups, and published by only one. 

Third, the amplitudes of the observed stratospheric ozone depletion 
should be critically assessed in all available data sources, discrepancies 
between simulated and observed variability in stratospheric ozone should 
continue to be explored, and remotely sensed observations used to estim- 
ate stratospheric ozone depletion should be processed by independent 
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research groups (for example, as done for MSU channel 4 temperatures). 
The World Meteorological Organization and International Ozone Com- 
mission are supporting an effort to critically evaluate ozone profile trends 
based on remotely sensed and in situ measurements. It remains to be seen 
whether revised estimates of stratospheric ozone depletion are large 
enough to account for the discrepancies between observed and modelled 
stratosphere temperature trends highlighted here. 

Finally, to avoid a continuation of the current perplexing and frustrat- 
ing situation, it is imperative that stratospheric altitudes are included in 
future climate reference data networks. The Global Climate Observing 
System (GCOS)—a project overseen by the World Meteorological 
Organization, the United Nations Environment Program, and other 
international bodies—is currently developing a ‘reference’ upper-air 
network consisting of around 30-40 ground-based stations that will be 
used to constrain the numerous atmospheric observations used in cli- 
mate research (the GCOS Reference Upper-Air Network; GRUAN*?). 
Other than these incipient GRUAN observations, there are currently 
no reference temperature data at stratospheric altitudes. The GRUAN 
effort is essential for assessing future stratospheric climate change with- 
out the ambiguities we currently face. 
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The global pattern of trace-element 
distributions in ocean floor basalts 
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The magmatic layers of the oceanic crust are created at constructive plate margins by partial melting of the mantle as it 
wells up. The chemistry of ocean floor basalts, the most accessible product of this magmatism, is studied for the insights it 
yields into the compositional heterogeneity of the mantle and its thermal structure. However, before eruption, parental 
magma compositions are modified at crustal pressures by a process that has usually been assumed to be fractional 
crystallization. Here we show that the global distributions of trace elements in ocean floor basalts describe a sys- 
tematic pattern that cannot be explained by simple fractional crystallization alone, but is due to cycling of magma 
through the global ensemble of magma chambers. Variability in both major and incompatible trace-element contents 
about the average global pattern is due to fluctuations in the magma fluxes into and out of the chambers, and their depth, 
as well as to differences in the composition of the parental magmas. 


Magmas parental to ocean floor basalts (OFBs) are traditionally con- 
sidered to evolve by fractional crystallization along ‘liquid lines of des- 
cent’, producing the oceanic crust with ~4—-5 km of gabbro underlying 
1-1.5km of basalt and dolerite’. The major-element compositions of 
OEB scatter around the low-pressure olivine+ plagioclase+ clinopyro- 
xene (ol+pl+cpx) cotectic (the hypersurface in compositional space 
where multiple solid phases will crystallize at the same time from a 
single liquid), consistent with their evolution by fractional crystalliza- 
tion of these phases (Fig. 1). The scatter is attributed to (1) composi- 
tional variation in the source, (2) differences in extent of melting 
reflecting variations in mantle potential temperature**, and (3) the 
proportion of the phases on the cotectic, which depends on both pres- 
sure of crystallization and magma composition*’. 

However, the abundances of highly incompatible trace elements 
increase with indicators of evolution like MgO content ([MgO]) more 
rapidly than can be explained by fractional crystallization*’. Although 
these early studies considered only local trends, the same pheno- 
menon appears when the chemistry of OFB is viewed from a global 
perspective. Consider three rare earth elements (REEs), representative 
of incompatible trace elements in general, whose contents from two 
global databases of OFB’®"' are plotted against [MgO] in Fig. 1d-f. 
Above ~5.5 wt% MgO, the incompatible trace elements are distri- 
buted approximately log-normally about a line of log[M] versus 
[MgO], where [M] is the content of M in the melt. The distribution 
of each M is characterized by three entities: first, the intercept, repre- 
senting an average parental content, [M],, taken here at [MgO] 
= 10 wt%; second, the slope by which the log-mean content changes 
with [MgO], that is, dlog[M])/d{[MgO]; and third, variability about 
the slope. To test whether these three entities remain approximately 
constant with [MgO], we plot the intercepts, slopes and variabilities 
for Na, P, K and Ti in bins of 1 wt% [MgO] for 9,050 samples previ- 
ously analysed by electron microprobe” (Fig. 2a-c). The large num- 
ber of samples is mandatory for obtaining precise values over such 
short segments. All three entities remain nearly constant from 5.5 to 
8.5 wt% MgO, which covers 85% of the glasses. 

Plots of intercepts, slopes and variabilities of all the REEs versus 
ionic radii reveal systematic patterns for all three entities (Fig. 2d-f). 


Crystal/melt partition coefficients for REEs correlate well with ionic 
radii'’, with approximately a parabolic relationship, indicating that 
crystal/melt partitioning controls both the intercepts and slopes. Also 
apparent is the systematic nature of the variability, confirming previ- 
ous observations'*”’, 

The linearity of log([M)] with [MgO] might appear consistent with 
fractional crystallization, because [M] varies with the degree of cry- 
stallization F and the bulk solids/melt partition coefficient Dy, ac- 
cording to the Rayleigh equation: 


log [M]/[M], = (Dm — l)log F (1) 


and, as shown in Fig. Ic, the relationship between F and MgO is well 
approximated along the ol+ pl+cpx cotectics by: 


log F= —1.245 + 0.132[|MgO] (2) 


However, fractional crystallization predicts that the slopes d(log[M])/ 
d[MgO]) should approach a theoretical limit of —0.132 asymptotically 
as Dy goes to 0. For the REEs, for example, the slopes expected for 
simple fractional crystallization are shown in Fig. 2e. The actual slopes 
of the light REEs exceed the theoretical limit (La by nearly a factor of 
two), and the plot of slope versus REE ionic radius (Fig. 2e) shows no 
sign of the expected asymptotic behaviour. The same phenomenon is 
found for a suite of highly enriched basalts from Macquarie Island’’. 

Trends for the other 12 trace refractory lithophile elements (RLEs) 
can be compared to the REEs by plotting intercepts versus slopes 
(Fig. 3a). The other trace RLEs with incompatibilities within the range 
covered by the REEs fall on the same parabola, with the exception of 
Sr, which plots to the right of the REEs. Sc, the least incompatible of 
the trace RLEs, falls off the extension of the REE array, probably be- 
cause a significant proportion is retained in the mantle residue during 
melting. The most incompatible RLEs (Ba, Nb, Ta and Th) define a 
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Figure 1 | Global trends in ocean floor basalt (OFB) glasses. a-c, Major- 
element trends in the crystallization of OFBs, illustrated using the global data 
sets of refs 10 and 11. Major-element oxides are plotted versus MgO, the oxide 
that is most sensitive to the degree of fractional crystallization; a, FeO,,, and 
Na,O; b, Al,O; and CaO. The average global trends (solid curves) were 
modelled using PETROLOG software” and models for ol, pl and cpx**. The 
representative parental composition at 10 wt% MgO is taken (in wt%) as: SiOz 
49, TiO, 1, AlyO3 17.5, FeOjo¢ 8, MgO 10, CaO 12, Na,O 2, MnO, KO and HO 
0.1, and P,Os 0.05. Pressure is 0.3 GPa and relative oxygen fugacity is set by the 
quartz—fayalite-magnetite oxygen buffer. Some of the mismatch for FeO,., may 
reflect problems with traditional estimates of MORB Fe*‘/Fe*" (ref. 44). The 
average mass fractions of phases crystallizing along the ol+cpx+ pl cotectic are: 
f° = 0.06, fP* = 0.44, fP! = 0.49. ¢, The fraction crystallized (F) is shown as a 
function of MgO and parameterized using an exponential relationship (solid 


minimum limiting slope, which we propose corresponds to Dy, ~ 0. 
Intercepts for the cosmochemically siderophile and volatile elements 
(SVEs) cannot be compared with the RLEs without introducing cir- 
cular arguments, because their normalizations would assume a rela- 
tionship to the RLEs’’. The exception is Pb, whose normalization to 
the RLEs is constrained by U-Th-Pb isotopic systematics'’. Because 
neither the slopes nor the variabilities depend on normalization, these 
entities for the SVEs are included in Fig. 3b. Several SVEs share the 
minimum limiting slope: Rb, Cs and W, and also Cl and Br from the 
Macquarie Island suite'*. There is a striking negative correlation 
(R = —0.95) between intercepts and variabilities. 


The model 


Although basalts comprise only a minor part of the magmatic output 
that forms the oceanic crust (most of the parental magma crystallizes 
as gabbro), it has been commonplace to assume that they preserve 
relative abundances of the incompatible trace elements, because sim- 
ple fractional crystallization should not change incompatible-element 
ratios at plausible degrees of crystallization (see equation (1); see also 
Fig. 2e). But it has been emphasized’*”° that repeated replenishment 
of crustal magma chambers followed by crystallization negates this 
assumption. The replenished magma inherits part of its composition 


curve). d-f, Variation in three REEs (d, La; e, Nd; f, Yb) with MgO using two 
global databases of OFBs'*"' and highly enriched basalts from a single locality, 
Macquarie Island'°. Elements are scaled by normalizing to CI chondrites. The 
lines (between 5.5 and 10 wt% MgO) are least-squares fits to the equation: 
log(M]/[M] ci = log{M]o/[M]ci + d(logiM])/d[MgO] x ([MgO] — 10), 
assuming equal weighting of log[M] and treating [MgO] as an independent 
variable. Each trace-element distribution is characterized by three attributes: 
(1) the mean of the log-normal distribution taken at 10 wt% MgO, called 
‘intercept’; (2) the slope of the mean of the log-normal distribution with MgO, 
d(log[M])/d{MgO], taken as constant; and (3) the variability, defined as: 


2 1/2 
o(logimy) = (2 ie log{M] a) 


concentration at that MgO content, calculated from the intercept and slope. 


, where [M] alc is the average 


from previous cycles, causing incompatible elements to build up rela- 
tive to compatible ones, which are preferentially removed by crystal- 
lization. If the replenishing magma has constant composition, and the 
fractions of the magma chamber tapped, dy, and crystallized, dx, in 
each cycle are constant, magma chambers of constant mass reach a 
steady state after a sufficient number of cycles, in which, for each 
element M, the flux in is balanced by the flux out: 


[M] (Pr + $x) = [M] rr + [M] xx (3) 


where [M], is the content of M in the replenishing (or parental) 
magma, [M]¥ is the content of M in the magma tapped to make basalts 
and dolerites, and [M]x is the content of M in the crystals. The idea of 
magma chambers undergoing cycles of replenishment and tapping 
is an attractive proposition for mid-ocean ridges. These structures sit 
athwart rifting zones and continually receive magma from decom- 
pression melting; they distribute the magma, with fraction ¢+ to basalt 
and dolerite, and ¢x to cumulate gabbros, to form the crust on top of 
the diverging plates. 

Individual magma chambers are unlikely to reach steady state 
because of changes in the composition of replenishing magmas, or 
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Figure 2 | Homogeneous relations among minor elements (Na, Ti, K and P) 
and rare earth elements (REEs) in OFB glasses. a-c, Intercepts (a), slopes 
(b) and variabilities (c) of Na, P, K and Ti from the Smithsonian abyssal 
volcanic glass data file’. The number of samples (1) in each overlapping 1 wt% 
MgO bin is given in a. P and K are near the limits of detection (especially at low 
contents in high-MgO samples), hence analytical uncertainty and pixelation of 
data** compromise the reliability of these data. Each characteristic is nearly 
constant in each MgO bin, with the exception of the two highest MgO bins, 
where steady state is probably not reached. The horizontal lines are the global 
averages. d-f, Intercepts (d), slopes (e) and variabilities (f) of REEs against ionic 
radius, using two global databases of OFBs'*"' and samples from Macquarie 
Island'’. The intercepts and slopes fall on parabolas, indicating control by 


in their size or in ¢y or $x, but one can envisage averages over many 
cycles and many chambers. Such averaging allows the global ensemble 
of magma chambers to be described by the average #x and 7, reple- 
nished by magma with average contents [M],, and fractionated with 
average bulk crystal/melt partition coefficients Dy. The global ave- 
rage content of M tapped as basalt or dolerite is [M],.. The phases cry- 
stallizing to form the cumulate are ol, pl and cpx. Bulk partition 
coefficients, Dy, are given by: 


Dy =Def?! Oa 7 (4) 


where De is the Plagioclase/melt partition coefficient (similarly for 
D\i* and De), and f?! is the mass fraction of plagioclase in the cry- 
stallizing mineral assemblage (similarly for f* and f°’). The mass 
fractions are related by f?! + f°* + f°! = 1, and similarly for their glo- 
bal averages (f°'and so on). Accessory phases with f< 0.01 need to be 
included for a few elements: spinel for Cr, and sulphide for the chalco- 
phile elements (S, Se, Cu, Ag and probably Ni and Pb). 

Elements known to be highly incompatible (My), but which other- 
wise have different chemical properties (Rb, Cs, Ba, Th, Nb, Ta, W, Cl 
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crystal/melt partition coefficients. The slopes calculated for simple fractional 
crystallization using the phase proportions on the cotectic for the model in 
Fig. 1, and crystal/melt partition coefficients from Supplementary Table 1, are 
shown in e. The slopes for the light REEs are greater than simple fractional 
crystallization can produce. Eu occurs partly in 2+ as well as 3+ oxidation 
state, making it more compatible in plagioclase during low-pressure evolution 
(higher slope) compared to the other REEs. The higher intercept reflects a 
positive Eu anomaly in the mantle source’®. The extra scatter in the data set of 
ref. 11 is due to the monoisotopic REEs (Pr, Tb, Ho and Tm) not being reported 
in some subsets. The curved lines are empirical fits to quadratic functions of the 
ionic radii. 


and Br) share the same limiting minimum slopes d(log{[My])/ 
d{[MgO] of —0.26 (Fig. 3). This observation provides the key to deci- 
phering the global patterns, because we can assign to these elements 
Dm, = 0, hence [My] is zero, and, regardless of whether crystalliza- 
tion is fractional or equilibrium, equation (3) reduces to: 


iy 


T 


(5) 


The linear relationship between log[My] and [MgO] can be written: 


[Maly _ 
[Mu], 


[Mu])/d[MgO] x ({MgO], — 


=o + *) 


It is then necessary to describe how [MgO] varies with ¢y and ¢. 
It has been noted*' that if the replenishing magma is mixed with 
inherited magma, the chamber is then tapped, and subsequently the 


[MgO],) 


(6) 


©2012 Macmillan Publishers Limited. All rights reserved 


te 
1.0+ 
x 
= 
So 08 
u 
8 
= 0.6+ 
& 
rei 
Q 0.4- 
a 
iS 05 
0.2 Th t ) Ba : === Variability = 0.49 — 0.40 x intercept 
: ® R=-0.95 
0.4 
05,1} @ os} : 
e 8 
Rb]|Ba 8 0.2 
° 
04+ ew 0.1 
Sc @ 
7 ONT aa 1 : 
% 02 O04 06 O08 10 1.2 
s O03 K Intercept at [MgO] = 10 wt% 
5 
& 
w 
= 0.2b 
Sr 
Minimum rei bd 
0.1 + limiting slope P 2 Bi 
Hr a 
Ti Y °Poo Yr e 
Na ge ie) 
0.0 : 
-0.3 0.2 =0.1 0.0 
Slope, d(log[M])/d[MgO] 
Figure 3 | Systematic relations between intercepts at 10% MgO, slopes and 


variabilities in OFB glasses. a, Intercepts versus slopes for REEs, other RLEs 
and Pb. b, Variabilities versus slopes for REEs, other RLEs and some SVEs. Note 
the minimum limiting slope of —0.26 reached by the most incompatible 
elements (Nb, Ta, Ba, Rb, Cs, W). Sr and Eu plot shifted to the right relative to 
REEs of similar incompatibility during mantle melting (Sr is similar to Nd, Eu 
falls between Sm and Gd) because of their compatibility in plagioclase; the same 
may explain the Pb anomaly. Sc has a lower intercept at 10 wt% MgO because of 
its retention in residual mantle after melt extraction due to its compatibility in 
orthopyroxene and olivine. Solid curves are least-squares fits to REEs only, with 
their extrapolations to more compatible behaviour shown as the horizontal 
dashed lines. ¢, Variabilities versus intercepts, with best-fit line showing the 
strong negative correlation (R = —0.95) between these entities. 


remaining magma evolves by fractional crystallization before the next 
cycle of replenishment, [M]x is related to [M]y through the usual 
equation for fractional crystallization (equation (1)), which, on sub- 
stitution into equation (3), gives [M]+ at steady state: 
[M]; (x +r) 
[M], 1+¢~—(1—¢ )™ 


This equation differs from that in refs 19 and 20, which assumed 
fractional crystallization would follow replenishment before tapping. 
Applying equation (7) to MgO gives: 

[MgO], (¢x +¢- 
[MgO], = 8 ox + or) 


1+ $,>—(1—ox)™ 


(7) 


(8) 


Equations (6) and (8) have two unknowns, ox and brs that are 
functions of the global average MgO content of the tapped magma at 
steady state, [MgO],. Real solutions have 0 < ¢x, ¢p < 1 and (x + 
¢y) <1; solutions outside these limits imply steady state cannot be 
reached under the assumed constraints. The equations can be solved 
uniquely if [MgO], and Dyg are specified. 


ARTICLE 


The average MgO content of parental OFB (that is, [MgO],) has 
remained controversial for decades, because the MgO content reflects 
the mantle potential temperature’. We circumvent the controversy 
by noting that [MgO], can be interpreted as the composition near the 
start of low-pressure magma-chamber evolution, and not necessarily 
as the ‘primary’ composition extracted from the mantle. Figure 4a shows 
the histogram [MgO] for the 9,050 OFB glasses in the Smithsonian data 
file’’. Only one glass has more than 10 wt% MgO (10.07 wt%). We there- 
fore take [MgO], to be 10.0 wt% or thereabouts. This is approximately 
the minimum required by several phase equilibrium constraints—in 
particular, the contraction of the primary phase volume of olivine with 
increasing pressure”’. The value of Dyigo along the ol+pl+cpx cotectic 
as modelled in Fig. 1 increases from 1.3 at 8.3 wt% MgO to 1.8 at 5.5 wt% 
MgO, but the replenishing magma would move liquid compositions off 
the cotectic into the ol+ pl primary phase volumes, giving initially higher 
Dwg. We make the simplifying assumption that Dug is constant, with a 
value determined by the fractions of phases crystallizing. 

Solutions satisfying all constraints (allowing for uncertainties in the 
literature values of Dy.) are obtained with constant Dogo at values 
near 2, which we optimized to 1.9, while increasing [MgO], to 10.4 wt% 
so that steady-state behaviour begins at [MgO], = 8.6 wt%, which is 
similar to the parental composition given in ref. 4. Steady-state beha- 
viour terminates at [MgO]. = 5.6 wt% (with ¢.; = 0; Fig. 4b), consis- 
tent with the distribution of [MgO] in OFB glasses (Fig. 4a). The 
implication of our model is that liquids with [MgO] >8.6 wt% are rare 
because these compositions either bypass magma chambers or come 
from juvenile magma chambers that have yet to reach steady state, pos- 
sibly contributing to the higher variability of P30; and K,O in the most 
Mg-rich basalts (Fig. 2c). Liquids erupted with [MgO] <5.6 wt% come 
from dying chambers that are not being recharged. 

Calculated log([M]+/[M],) for various values of Dy are shown as a 
function of [MgO] in Fig. 4c. The trends are linear, as required empiri- 
cally (Fig. 1d-f). We use 19 trace elements selected because they have 
relatively well constrained partition coefficients (ten REEs; four other 
RLEs, namely Sc, Ti, Sr and Zr; plus the five SVEs, namely Li, Na, Mn, 
Co and Ni) to calculate the best match between Dy, obtained from 
slopes and the crystal/melt partition coefficients from the literature; 
we do this by least-squares optimization of f?', f* and _f!, with the 
constraints f?! + fP* + f°! =1 and Dyg = 1.9. We find f°? =0.18, 
fP! = 0.41 and f‘P* = 0.42 (details in Methods). With an uncertainty 
of 20% in crystal/melt partition coefficients, the mean square 
weighted deviation (MSWD) for the fit is 1.4, indicating good agree- 
ment between model and data (Fig. 5). For the enriched Macquarie 
Island basalts, slopes are higher for the cpx-hosted trace elements such 
as the heavy REE (Fig. 2e), but lower for plagioclase-hosted Sr, Eu and 
Na, signifying that they have evolved by crystallization with much 
higher cpx/pl. The least-squares optimization of the model for these 
basalts gives f°! = 0.06, f?! = 0.09, fP* = 0.85 and Dog = 2.2, assu- 
ming [MgO], = 10.4 wt%. 

The global solution infers a relationship between ¢y and ¢, in the 
ensemble of OFB magma chambers (Fig. 4b). These two variables 
control the average MgO content of the magma chamber (that is, 
[MgO]..), which varies nearly linearly with the average fraction of 
magma retained after each cycle (that is, 1 — dy — ¢,). Either there 
is some process by which #y determines #; on average, or vice versa. 
Whether this is plausible is a matter for geophysical fluid dynamics 
and is raised here as a hypothesis. It is easier to visualize physical 
reasons why magma chambers might erupt a constant fraction of their 
contents (that is, #; constant), but such models do not produce linear 
trends of log[M] versus [MgO] under our simplifying assumption of 
constant partition coefficients. However, an elaborated model—with 
values of Dy and Dug allowed to vary with [MgO] and decreasing 
temperature, or with related indices of evolution (such as the 
anorthite content of plagioclase), as indicated by experimental parti- 
tioning studies (see, for example, refs 24-26)—may provide satisfac- 
tory solutions. The model does not depend on the sizes of the magma 
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Figure 4 | The global ensemble of replenished and tapped magma chambers. 
a, Histogram of [MgO] in OFB glasses from the Smithsonian abyssal volcanic 
glass data file'’ (shaded bars), with the subset of samples analysed for trace 
elements in ref. 2 shown by the white bars. Also shown (right-hand y axis, dot- 
dashed line) are the densities of liquids for the model given in Fig. 1, calculated 
with PETROLOG*’. The frequency distribution of [MgO] does not 
correspond to the density minimum. b, Solution to the steady-state magma 
chamber model with constant Dyyg = 1.9 and [MgO], = 10.4 wt%. ¢y is the 
average fraction of the average magma chamber tapped per cycle, and #y is the 
average fraction crystallized. Steady state is achieved only within the limits 
0<(y + $x) < 1. The region of steady state coincides with the distribution of 
[MgO], which explains why OFB with >8.6 wt% MgO or <5.6 wt% MgO are 


chambers, which may be too small in places to be imaged by geo- 
physical means. The ratio ¢;/¢y changes from 0.5 at 8.6 wt% MgO 
to 0.06 at 5.6wt% MgO, leading to a prediction that the ratio of 
basalt + dolerite to cumulus gabbro in oceanic crust might vary simi- 
larly, correlating with the MgO content of the basalts. 
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Figure 5 | Testing the model. Shown is a comparison of bulk partition 
coefficients Deere calculated from the observed slopes d(log[M])/d[MgO] 
assuming the model values of #; and $x (see Fig. 4b), with bulk partition 
coefficients calculated from experimentally or empirically determined mineral/ 
melt partition coefficients from the literature, at the model’s best-fit values of 
f?* = 0.42, with f?! = 0.41 and f°” = 0.18. Error bars are +2s.d. Good 
agreement is obtained for all elements including the compatible element Ni, 
with a reduced y? (MSWD) of 1.4. See Methods and Supplementary Table 1 for 
the partition coefficients used in the modelling. 
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[MgO], = 10.4 wt%. Slopes d(log[M])/d[MgO] are calculated to be constant, 
except for the hypothetical case of a very highly compatible element. Inset, 
dependence of Dy, on the slopes. A distinctive feature of the model is that slopes 
are most sensitive to Dy for elements with 0 < Dy, < 0.1 (that is, highly 
incompatible), but with the slopes nevertheless varying nearly linearly with Dy 
in this region until a cut-off is reached at the minimum limiting slope. 

d, Decrease of compatible element [Ni] with [MgO] in OFB glasses'* compared 
to that calculated using our model, with [Ni], = 220 p.p.m., Dog = 1.9 and 
[MgO], = 10.4. wt%. A parental liquid with [Ni], = 220 would be in 
equilibrium with mantle olivine with 3,300 p.p.m. Ni if py melt 15 (see 
Supplementary Information), precluding significant olivine-only fractionation. 


Incompatible elements are fractionated from each other in propor- 
tion to the absolute difference in their bulk partition coefficients (that 
is, as Di —Dyz). Such fractionation is much greater than can be 
achieved by simple fractional crystallization, and can change the 
ratios of all but the most highly incompatible elements. Thus the 
model can explain the observed differences between slopes of K, U 
and La, which have Dy ~ 0.01-0.05, and the difference between these 
slopes and those of the even more incompatible elements (Ba, Th, Nb, 
and so on). Only for these latter elements, which share the limiting 
minimum slope, does the model predict that their ratios should 
remain constant. Nb/Th and Rb/Ba would be examples. 

The influence of plagioclase can be used to distinguish between 
crustal and mantle processes, because plagioclase is not stable in the 
pressure interval over which partial melting in the mantle takes place, 
but is a major phase in low-pressure crystallization (see, for example, 
refs 5, 6, 27). For the majority of incompatible trace elements, clino- 
pyroxene is the main host phase during both mantle melting and 
crustal evolution, which accounts for the good correlation between 
intercepts and slopes (Fig. 3a). Sc is one exception, because its compa- 
tibility in orthopyroxene allows a significant proportion to be retained 
in the mantle residue of partial melting**. Anomalous behaviour is 
also shown by Sr and Eu. Because of their compatibility in plagioclase 
(for example, pe melt 1.6; ref. 28), Sr and Eu plot shifted towards the 
right in Fig. 3a relative to their normal geochemical compatibility 
among the REEs (Sr is considered to be similar in incompatibility 
to Nd during mantle melting, and Eu should fall between Sm and 
Gd). So does Pb, which is also more compatible in plagioclase than 
clinopyroxene (DP/ mel ne 0.1; ref. 29), although fractionation of Pb by 
immiscible sulphide matte (responsible for the compatibility of Cu 
and Ag in OFB evolution’) may additionally contribute. The slope of 
Pb is similar to that of Ce during low-pressure evolution, but its 
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depletion in the mantle source, indicated by its normalized intercept 
value, is closer to that of U, providing a reconciliation of the ‘third Pb 
paradox”™’. The variability of Sr stands out as larger than that of other 
elements with similar slopes (Fig. 3b), but is consistent with its incom- 
patibility (similar to Nd) during mantle processes. 

Another element much more compatible in plagioclase than clino- 
pyroxene, especially at low pressures, is Na. As the Macquarie Island 
basalts show, variations in Na contents in OFB at a given [MgO] may 
reflect the variations in the ratio of pl/cpx that have fractionated. 


Variability of trace elements in OFB 


Much of the interest in OFB geochemistry comes from the expecta- 
tion that part of the variations in their composition may be due to 
plate-tectonic variables, including the thermal structure of the mantle. 
For example, variations in Na, Fe and Ca/Al correlate with plate 
spreading rate and perhaps axial depth of mid-ocean ridges***?"**. 
A key question is what proportion of the observed variations arises 
from low-pressure evolution, and what proportion is due to mantle 
processes, or source composition. The entity that we call variability 
addresses this problem. Note that our definition of variability (see 
Fig. 1 and Supplementary Information) differs from that used previ- 
ously'*”*, because we calculate it relative to the global trends, assu- 
ming probability density functions of log-normal distributions with 
the mean depending linearly on [MgO]. 

Although the very highly incompatible elements (Ba, Nb, Th and 
so on) are not fractionated from each other by the magma-chamber 
processes, they show the highest variabilities (Fig. 3), which are there- 
fore ascribed to variability of the parental magmas (that is, in [M].,). 
Potentially this may derive from variations in extent of melting, the 
mechanism of melt extraction (for example, residual porosity), or be 
inherited from the mantle source. The distributions of radiogenic 
isotopic ratios in OFBs allow us to monitor the heterogeneity of source 
compositions. There is an imperfect correlation (R ~ —0.5) globally 
between La/Sm and '**Nd/'“*Nd, which reflects time-averaged Sm/Nd 
(see figure 14 in ref. 35). However, the variations of '8Nd/'“4Nd in 
modern basalts are a function of both the extents of the fractionation of 
parent Sm from daughter Nd in the source, and the times at which the 
fractionations happened. Subsequent fractionation of La/Sm (and 
hence by implication, Sm/Nd) either by melting processes, or, as shown 
to be possible here, during low-pressure differentiation, will further 
blur the correlation. Although source effects are quite important, iso- 
tope/trace-element correlations cannot quantify them very well at this 
point. 

Variability will accrue from the many ways that the magma cham- 
bers depart from both steady state and the global average. The basic 
principle is that a magma will be more enriched than the average at the 
same [MgO] by inheriting a higher proportion of enriched magma 
from previous cycles, and vice versa. Magma may also be erupted out 
of sequence—for example, from a chamber undergoing fractional 
crystallization between replenishments. Basalts erupted in one locality 
over a few years, such as those from well-studied segments of the East 
Pacific Rise***’, probably record such departures. Compositions from 
these suites plot along short chords of log|M] versus [MgO] arranged 
‘en échelon’ at low angles across the global trends, as expected for 
simple fractional crystallization. We envisage the global trends as 
the averages of many such chords. 

The number of cycles needed to reach steady state increases with 
the incompatibility of the element*’. Although this may account for 
some of the increased variability of the most incompatible elements, 
mixing in the magma chambers acts to homogenize variability in 
parental magmas. Melt inclusions trapped in growing crystals may 
preserve incompletely homogenized aliquots of replenishing magma, 
which explains why they often record greater variability than bulk 
magmas”. Detailed interpretations of the chemical variability in short 
ridge segments has highlighted the complexity in individual magma 
chambers undergoing replenishment**”’. But when taken over the 
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global ensemble of magma chambers, this complexity averages out 
to give the systematic global trends shown in Figs 2 and 3. 


Implications for OFB petrogenesis 


The ‘cotectic illusion’ (Fig. 1) is caused by crystallization in reple- 
nished magma chambers forcing liquids onto the cotectic, on which 
magma compositions appear “perched”’”*°. Erupted major-element 
compositions are indistinguishable from those produced by fractional 
crystallization along the cotectic, but their trace element systematics 
differ greatly. A series of compositions plotting along a cotectic may 
not be a ‘liquid line of descent’ related by simple fractional crystal- 
lization, but the locus of different fractionation paths, each ending on 
the cotectic. The assumption that such liquid compositions reflect the 
depth of the magma chamber in which they evolved remains valid*°, 
but the many inferences drawn from OFB chemistry that have as- 
sumed the paradigm of simple fractional crystallization in order to see 
beyond their low-pressure evolution require re-evaluation. 

Simple fractional crystallization is inefficient at changing the ratios 
of incompatible elements. There has therefore been a presumption 
that ratios of even quite moderately incompatible trace elements like 
the mid to heavy REEs in OFB are inherited from their parental com- 
positions. On the contrary, replenished magma chambers are effective 
at changing incompatible-element ratios. Only ratios among elements 
with Dy < 0.01 (for example, Ba, Nb, Ta and Th and the SVEs Rb, Cs 
and W) are expected to escape change during low-pressure evolution. 

Uranium-series disequilibria provide an example of how the reple- 
nished magma chamber model necessitates re-evaluation of petro- 
genetic interpretations. Uranium-series disequilibria measure Th/U 
fractionations over short timescales (<10* years), so cannot reflect 
source heterogeneity, but have been ascribed to in-growth during 
partial melting”. Fractional crystallization has been assumed to be 
inconsequential. However, using our solution to the average global 
pattern, the fraction of a perfectly incompatible element (D,, = 0) 
inherited from previous cycles of the global-average magma chamber 
is 81% at 8 wt% MgO and 93% at 6 wt% MgO, raising the possibility 
that U-series secular disequilibria may reflect in-growth in the magma 
chamber. Among the global ensemble of magma chambers there will 
be those whose characteristics will produce even more extreme effects. 


METHODS SUMMARY 


Trace-element contents of OFB glasses were fitted to the equation: 
log[M] = log[M], + slope({MgO] — 10) 


by minimizing 7’: 


= ogi? (eel log[M], —slope(10—[MgO])” 


assuming that o(log[M]) is constant. If y7/n is put equal to 1, its most probable 
value, then: 


n 


2. 1/2 
o(log(M]) (= (ogi) ~ log/M] 1.) ) 


where n is the number of data and log[M] caic = log{M]« 
We take o(log[M]) as the definition of variability. 

To facilitate comparison, intercept values for RLEs at 10 wt% MgO are plotted 
normalized to CI chondrites using data in ref. 17. The intercept value of Pb was 
likewise normalized by dividing by the bulk silicate Earth value of 0.185 p.p.m. 
obtained from ***U/?™Pb = 8.5, and multiplying by the RLE enrichment factor of 
2.8 (ref. 17). 

The evaluation of the steady-state magma-chamber model proceeds by first 
calculating, as a function of the average tapped MgO content, [MgO], the para- 
meters ¢ and $x, the global averages of the fractions tapped and crystallized, 
where the averages are to be thought of as taken over the global ensemble of 


slope(10 — [MgO]). 
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magma chambers. This initial stage of the calculation uses the observation that 
the slopes of the logarithms of the concentrations versus MgO for all the very 
highly incompatible elements (that is, d(log[My])/d[MgO]) share a constant 
value of —0.26 (Fig. 3). These calculated parameters were used to calculate the 
slopes for bulk partition coefficients greater than zero (see Fig. 4c). These relation- 
ships were then used to convert the observed slopes for 19 elements to bulk 
partition coefficients, D¥??*. Finally, values of D*?* were matched to bulk par- 
tition coefficients calculated from crystal/melt partition coefficients, obtained 
from experiment or natural phenocryst/matrix pairs, as shown in Fig. 5. The 
model has three parameters, the global average fractions of plagioclase, clinopy- 
roxene and olivine crystallizing (f"", f* and f"', respectively), only one of which 
is independently variable. A fuller description is given in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 
The global average values of ¢, and $x in the steady-state magma-chamber 
model for the global average concentrations of the analysed trace elements, M, 
both incompatible and compatible, are obtained from equations (6) and (8) as a 
function of the MgO content of the tapped magma, using the observation that the 
minimum limiting slope, d(log[My])/d[ MgO], which is observed to be —0.26 
(Fig. 3), corresponds to completely incompatible elements, My, with presumed 
Dm, = 0. The calculation proceeds by using equation (6) to define ¢, in terms of 
oy: 

ox = bor (9) 
where: 


b= 10'~a(ogiMu))/d[Mgo] x ((MgO], —[MgO],} _ i 


(10) 
Substitution of this relationship into equation (8) then gives: 


MgO}; _ Ar(1+6) 
[MgO] c 


oO 


(11) 
where: 


c=14+¢,—(1— boy) (12) 

The model requires that the average MgO content of parental OFB, [MgO],, 
and the average bulk partition coefficient for Mg, Dug, are both known. The solu- 
tion also determines the range of [MgO]., over which the physically real values of 
(@y + $x) occur: the lower limit of [MgO], for steady state is at (?p + px) =0 
while the upper limit is at (f-, + 6x) = 1. We therefore performed a grid search of 
[MgO], and Dmg bivariate space around petrologically reasonable initial esti- 
mates of 10.0 wt% and 2, respectively, which established that the solution with 
[M], = 10.4 wt% and Dyg = 1.9 reached steady state between 5.6 and 8.6 wt% 
MgO, in good agreement with values implied by the histogram of MgO frequen- 
cies in OFB (Fig. 4a). The minimum limiting slope, d(log[My,])/d[MgO], was 
taken as —0.26. 

The values of o; and ¢ thus found as a function of [MgO], which are plotted 
in Fig. 4b, can then be substituted into equation (7) to calculate numerically [M]- 
as a function of [MgO], at steady state for values of Dy other than zero: 


[M] 
[M], 


Gs +$r) 7 
1—¢,—(—¢x)™ 


(13) 


The relationship between log({M],/[M],) and [MgO]. for values of Dy 

between 0 and 1,000 is shown in Fig. 4c. This relationship is linear except at very 
high Dy (>10), which is not relevant to any of the trace elements controlled by 
the major crystallizing phases (plagioclase, clinopyroxene and olivine), but could 
perhaps describe the behaviour of Cr (highly compatible in spinel) or Ir, Os and 
Ru (in alloys or sulphide). 
Inverse modelling and data fitting. The calculation can be inverted to deduce the 
value of the bulk partition coefficient that matches the observed slope d(log[M])/ 
d[MgO], called here D¥°"*. The calculation was done numerically by setting up an 
Excel spreadsheet to calculate values of log([M].,/[M],) as a function of an arbit- 
rary input of Dy at intervals of [MgO], of 0.1 wt% from 5.7 to 8.5 wt%, then 
computing the average linear slope d(log({M].-/[M],))/d[[MgO}.,]. Values of Dyy 
were then changed to achieve a match between calculated and observed slopes. 
Uncertainties o(DyeP*) are calculated in the same way from the observed uncer- 
tainties in the slopes. 

The test of the model is how well the calculated values of D¥??* match the bulk 
partition coefficients, Dy, which are the sum of the crystal/melt partition coeffi- 
cients: 


1/melt F ItZ 1/melt F 
Du = pen ‘pl + pepx/me cpx + pe/m ‘ol 


(14) 


with the constraint: 


espe sp a1 (15) 

The crystal/melt partition coefficients are obtained from either experiment or 
phenocryst/groundmass analyses of basalts (see Supplementary Table 1). Because 
the model is based on Dyygo = 1.9, for internal consistency we imposed the second 
constraint: 
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(16) 


__ ppl/melt Fpl cpx/melt Fepx ol/melt Fol 
1.9=Dyeo f? +Dyigo P* + Digo 


with the values of DB", DEX" and DeV/?"" fixed at those calculated along the 
cotectic (Fig. 1; Supplementary Table 1). With two constraints, there is only a 
single parameter to be refined, say f?*. 

The optimum solution is obtained by minimizing the conventional least- 
squares cost function, v: 


(17) 


slope 2 2 
2 (Dy — Dm (calc)) (Dm — Dy (calc)) 
x , ; pslope\2 ate ) : 2 
i o(Dy*) M o(D) 


Dy,(calc)are the best-fit model values, which are found for each M by setting 
dz’ /dDy,(calc) = 0, which gives: 
Dy" +Du(o(Dy") /o(Dw)”) 


D; i 
ea TF (DE) /o(Du?) 


(18) 


Since there is only a single parameter to vary (that is, fP*), the minimization of 7? 
converges rapidly starting from any initial guess of f?* between 0 and 1. 

We used trace elements whose crystal/melt partition coefficients between OFB 
or similar melt compositions and olivine, clinopyroxene and plagioclase have been 
determined reasonably robustly at relevant pressures and temperatures (Sup- 
plementary Table 1). The partition coefficients of these 19 elements run from 
the highly incompatible La to the most compatible of the studied trace elements, 
Ni, and include elements concentrated into cpx (heavy REEs, Ti and Zr), plagio- 
clase (Sr and Na) and olivine (Ni) as well as one evenly distributed among the three 
phases (Li). We note that elements of greater incompatibility than La (that is, Ba, 
Nb, Ta, Th, Rb, Cs, W, Cl and Br) are already incorporated in the model under the 
assumption that their bulk Dy, is effectively zero. The selected values of the cry- 
stal/melt partition coefficients are given in Supplementary Table 1. For the least- 
squares fitting, uncertainties were assumed to be +20%, one standard deviation. 
The bulk partition coefficients of several elements are so dominated by one phase, 
that their partition coefficients for the other two phases are inconsequential. 

Supplementary Table 1 also lists the global average slopes and their uncertain- 
ties from the database’® of the selected elements, and the bulk partition coeffi- 
cients from these slopes, Dye calculated from the model with [M], = 10.4 wt%, 
Duo = 1.9, and the minimum limiting slope, d(log[My])/d[ MgO], of —0.26, 
along with propagated uncertainties. 

The least-squares optimization (equation (17)) gives fore = 0.42, with 

fe =041 and f” = 0.18, with y” = 24.6, hence the reduced 7” or MSWD is 1.4 
(18 degrees of freedom), indicating a good fit to the model at the assumed level of 
uncertainty, as shown in Fig. 5. The values of Dy(calc) are where bulk Dy and 
DSPP* would plot on the 1:1 line in Fig. 5. 
Discrepancies and potential problems. Two features of OFB trace-element pat- 
terns are discrepant with accepted partition coefficients. Clinopyroxene fractiona- 
tion should dominate heavy REE patterns. Most studies show values of D&X/™*" 
peaking in the middle of the heavy REE (at about Ho; ref. 26), but the slopes 
d(log[REE])/d{MgO] do not show a maximum there (Fig. 2e). Second, K and 
especially Ba are too compatible in plagioclase for their observed relationships 
with [MgO] to be consistent with plagioclase fractionation (DR/™*" ~ pal/™*" 
=~ 0.2; ref. 28). With these partition coefficients, Ba and K should be less incom- 
patible than La during OFB low-pressure evolution, whatever the model, as long as 
plagioclase is a significant player. 

The systematics of most of the incompatible trace elements in OFB that are not 
completely incompatible is dominated by clinopyroxene crystallization (for 
example, REE, Zr, Hf and Sc). Yet clinopyroxene phenocrysts or micropheno- 
crysts are extremely rare in OFB glasses, whereas plagioclase is ubiquitous and 
olivine very common. The lack of clinopyroxene microphenocrysts is expected 
because the decrease in pressure when liquids are tapped from magma chambers 
moves the liquids off the three-phase cotectic into the ol+plag two-phase field. 
The lack of clinopyroxene phenocrysts might be due to clinopyroxene crystalli- 
zing only on the bottom of the magma chambers, where pressure is greatest, and 
therefore these crystals would not be entrained on eruption. Note that in our 
model, eruption in each cycle follows replenishment and mixing but precedes 
fractional crystallization. This emphasizes that the phenocryst assemblages 
observed in OFB do not reflect at all faithfully the assemblages crystallizing in 
the magma chambers that control their low-pressure evolution. 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature11650 


Analysis of the bread wheat genome using 


whole-genome shotgun sequencing 


Rachel Brenchley’, Manuel Spannag!’, Matthias Pfeifer?, Gary L. A. Barker’, Rosalinda D’Amore!, Alexandra M. Allen®, 
Neil McKenzie*, Melissa Kramer?, Arnaud Kerhornou®, Dan Bolser®, Suzanne Kay‘, Darren Waite*, Martin Trick*, lan Bancroft*, 
Yong Gu’, Naxin Huo’, Ming-Cheng Luo’, Sunish Sehgal’, Bikram Gill’, Sharyar Kianian'°, Olin Anderson’, Paul Kersey’, 

Jan Dvorak®, W. Richard McCombie”, Anthony Hall', Klaus F. X. Mayer’, Keith J. Edwards*, Michael W. Bevan* & Neil Hall! 


Bread wheat (Triticum aestivum) is a globally important crop, accounting for 20 per cent of the calories consumed by 
humans. Major efforts are underway worldwide to increase wheat production by extending genetic diversity and 
analysing key traits, and genomic resources can accelerate progress. But so far the very large size and polyploid 
complexity of the bread wheat genome have been substantial barriers to genome analysis. Here we report the 
sequencing of its large, 17-gigabase-pair, hexaploid genome using 454 pyrosequencing, and comparison of this with 
the sequences of diploid ancestral and progenitor genomes. We identified between 94,000 and 96,000 genes, and 
assigned two-thirds to the three component genomes (A, B and D) of hexaploid wheat. High-resolution synteny 
maps identified many small disruptions to conserved gene order. We show that the hexaploid genome is highly 
dynamic, with significant loss of gene family members on polyploidization and domestication, and an abundance of 
gene fragments. Several classes of genes involved in energy harvesting, metabolism and growth are among expanded 
gene families that could be associated with crop productivity. Our analyses, coupled with the identification of extensive 


genetic variation, provide a resource for accelerating gene discovery and improving this major crop. 


With a global output of 681 million tonnes in 2011', bread wheat 
accounts for 20% of the calories consumed by humans’ and is an 
important source of protein, vitamins and minerals. It originated 
from hybridization between cultivated tetraploid emmer wheat 
(AABB, Triticum dicoccoides) and diploid goat grass (DD, Aegilops 
tauschii) approximately 8,000 years ago’. Bread wheat cultivation 
and domestication has been directly associated with the spread of 
agriculture and settled societies, and it is now one of the most widely 
cultivated crops owing to its high yields and nutritional and pro- 
cessing qualities. The three diploid progenitor genomes, AA from 
Triticum urartu, BB from a species that is unknown but which may 
be of the section Sitopsis (to which Aegilops speltoides belongs), and 
DD from Ae. tauschii, radiated from a common Triticeae ancestor 
between 2.5 and 4.5 million years ago, and AABB tetraploids arose 
less than 0.5 million years ago**. Nucleotide diversity in the AABB 
and DD genomes is substantially reduced compared with ancestral 
populations, indicating a major diversity bottleneck on the transition 
to cultivated lines’. 

Grass genomes show extensive long-range conservation of gene 
order’. Nevertheless, they are highly dynamic owing to the activities 
of repeats that contribute to tremendous variation in genome size’, 
changes in local gene order and pseudogene formation, particularly in 
larger genomes such as those of maize’! and wheat’’. From analysis of 
BAC contigs on chromosome 3B, the 17-gigabase-pair (Gb) genome 
was estimated to be composed of approximately 80% repeats, prim- 
arily retroelements, with a gene density of between 1 per 87 kilobase 
pairs and 1 per 184 kilobase pairs’’. Despite both the substantial 
knowledge gained of the wheat genome from these studies and the 
central importance of the wheat crop, a comprehensive genome-wide 


analysis of gene content has yet to be conducted owing to its large size, 
repeat content and polyploid complexity. 

We have analysed a low-coverage, long-read (454) shotgun 
sequence of the hexaploid wheat genome using gene sequences from 
diverse grasses. From this, we created assemblies of wheat genes in an 
orthologous gene family framework, used diploid wheat relatives to 
classify homeologous relationships, and defined a genome-wide cata- 
logue of single nucleotide polymorphisms (SNPs) in the A, B and 
D genomes. These analyses provide a foundation for genetic and 
genomic analysis of this key crop. 


Sequence analysis 


The wheat variety Chinese Spring (CS42) was selected for sequencing 
because of its wide use in genome studies’*'*. Purified nuclear 
DNA was sequenced using Roche 454 pyrosequencing technology 
(GS FLX Titanium and GS FLX+ platforms) to generate 85 Gb of 
sequence (220 million reads), corresponding to approximately five- 
fold coverage on the basis of an estimated genome size of 17 Gb. 
Supplementary Table 1 shows that 79% of the reads had matches to 
the Triticeae Repeat Sequence Database, and most hit retrotrans- 
posons, consistent with previous studies'’. To identify A-, B- and 
D-genome-derived gene assemblies in the hexaploid sequences, we 
used Illumina sequence assemblies of Triticum monococcum, related 
to the A-genome donor, Ae. speltoides complementary DNA (cDNA) 
assemblies and 454 sequences from the D-genome donor Ae. tauschii, 
respectively. The SOLiD platform was used to generate additional 
sequence of CS42 and three commercial wheat varieties to in- 
crease the accuracy of homeologous SNP identification. Data sets 
are summarized in Table 1 and Supplementary Table 2, and SNP 


1Centre for Genome Research, University of Liverpool, Liverpool L69 7ZB, UK. 2MIPS/IBIS, Helmholtz- Zentrum Munchen, 85764 Neuherberg, Germany. 3School of Biological Sciences, University of Bristol, 
Bristol BS8 1UG, UK. “John Innes Centre, Norwich NR4 7UH, UK. °Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA. ®European Bioinformatics Institute, Hinxton CB10 1SD, UK. 
7USDA Western Regional Laboratory, Albany, California 94710, USA. 8Department of Plant Sciences, University of California, Davis, California 95616, USA. 2Department of Plant Pathology, Kansas State 
University, Manhattan, Kansas 66506, USA. 1Department of Plant Sciences, North Dakota State University, Fargo, North Dakota 58018-6050, USA. 


29 NOVEMBER 2012 | VOL 491 | NATURE | 705 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Table 1 | Sequence sources used for analysis 


Genome Platform Size of data set Reference 
T. aestivum (CS42) genomic DNA 454 GS FLX Titanium/454 GS FLX+ 85 Gb EBI study: ERPO00319 
T. aestivum (CS42) genomic DNA from sorted 454 GS FLX Titanium 1A: 287 Mb 1B: 392 Mb 1D: 375 Mb Ref. 12 


chromosomes 1A, 1B and 1D 

T. aestivum (CS42, Avalon, Rialto, Savannah) 
genomic DNA 

T. aestivum (CS42) cDNA 

T.monococcum genomic DNA 


SOLID 3/SOLiD 4 


IIlumina GAIIx/HiSeq 
Ae. speltoides cDNA Pre-assembled data 


454 GS FLX Titanium 
SOLID 4 


Ae. tauschii genomic DNA 
Ae. tauschii genomic DNA 


454 GS FLX Titanium/454 GS FLX+ 


15.2 billion reads EBI study: ERPOO1493 


1.6 Gb EBI study: ERPOO1415 
A/B/D sequences: 3.7 Gb A/B/D NCBI archive: SRPO004490.3 
SNPs: 401 Gb 
151 Mb M. Trick and I. Bancroft, unpublished 
observations 
12.8 Gb M.-C.L. et a/., submitted 


80-100-fold coverage J. Dvorak, unpublished observations 


EBI, European Bioinformatics Institute; NCBI, US National Center for Biotechnology Information. 


identification methods are described in Supplementary Information, 
section 5.2. 


Sequence assembly 


An orthologous group assembly (Supplementary Table 3) was created 
by clustering 454 reads by sequence similarity to orthologous grass 
gene sequences, and separate assembly of the clusters at high strin- 
gency using Newbler (Supplementary Information, section 2). The 
orthologous genes were derived from rice’’, sorghum*, Brachypodium? 
and barley full-length cDNAs by OrthoMCL” clustering. This generated 
20,496 orthologous groups (Supplementary Table 4 and Supplemen- 
tary Fig. 1). The gene model with highest similarity to wheat (termed 
the orthologous group representative (OGR)) was selected from each 
orthologous group by stringent BLASTX comparison to a low-copy- 
number genome assembly (LCG) made by filtering out repetitive 
sequences and assembling the remaining low-copy-number sequences 
de novo (Supplementary Table 3). The assemblies are described in 
Table 2. Nearly 90% of the metabolic genes in Arabidopsis matched 
OGRs, and the 20,051 OGRs matched 92% of publicly available wheat 
full-length cDNAs'* and 78.7% of the harvEST set of wheat cDNA 
assemblies (Supplementary Fig. 2), indicating that they represent 
nearly all wheat genes. 

We optimized parameters for wheat gene assembly using MetaSim”” 
to generate simulated fivefold 454 reads from the allotetraploid maize 
genome and from a triplicated rice gene set, with the introduction of 
sequence variation (Supplementary Information, section 2.7). Similar 
degrees of coverage over the OGRs were seen for the simulated data sets 
and wheat 454 reads (Fig. 1a). Rice reads followed the same depth 
distribution as the wheat reads (Fig. 1b), suggesting that they are a 
reasonable representation of hexaploid sequences. Maize reads covered 
their OGRs to a median depth of approximately five, consistent with 
fivefold coverage. 

Simulated maize and triplicated rice 454 reads were used to optimize 
assembly parameters. Assembly at 99% minimum sequence identity 
(m.i.) using 40-bp overlap length predicted gene family sizes most 
accurately (Supplementary Figs 3-6). Wheat 454reads were pre- 
processed (Supplementary Table 5) and assembled using 99% m.i. 
(Supplementary Tables 6 and 7) to create the orthologous group 
assembly. Figure 1b shows that the depth of coverage of the ortholo- 
gous group assembly followed a similar pattern to maize, consistent 


with multiple gene copies. In contrast, the low depth coverage by the 
LCG assembly suggested that gene family numbers were collapsed. The 
number of wheat assemblies for each OGR was calculated to deter- 
mine gene copy numbers (Supplementary Table 7). Figure 1c shows 
that most OGRs had between one and five distinctive wheat gene 
assemblies, with a peak of two genes. 

The A, B and Ae. tauschii (D) genomes'*”°*! have been estimated to 
contain approximately 28,000, 38,000 and 36,000 genes, respectively. 
We estimated the number of genes in the hexaploid wheat genome to 
range between 94,000 and 96,000 (Supplementary Information, sec- 
tion 2.10). This is reasonably consistent with estimates based on wheat 
chromosome sequences'*. Comparing our transcriptome assembly 
(Supplementary Information, sections 2.8 and 2.9) and wheat 
harvEST with the wheat OGRs showed that 76% and, respectively, 
65% were expressed under the conditions used for RNA isolation. 
Similar results were found in barley”, rice’® and maize”’, indicating 
that the assemblies are bona fide wheat genes. 

We defined the overall extent of gene conservation between wheat 
and the most closely related sequenced pooid grass, Brachypodium 
distachyon”*. Track 1 of Fig. 2 shows that there is a high degree of 
overlap between the gene sets of Brachypodium and wheat, but with 
regions of lower conservation, for example on Brachypodium chro- 
mosomes | and 4. Syntenic maps of the Brachypodium genome and 
the A-, B- and D-chromosome groups were created by integrating 
high-density wheat EST-based markers” with Brachypodium genes 
(Fig. 2, tracks 5, 6 and 7, respectively). Supplementary Fig. 7 shows the 
A-, B- and D-genome markers separately. Syntenic alignments were 
readily identifiable and conformed to the predicted major patterns””*. 
We identified many insertions and/or translocations of blocks of 
genes within the overall conserved patterns of gene order, including 
the major rearrangement on chromosome 4A as shown on Brachypo- 
dium chromosome 1 (ref. 20). Lower marker density on the D genome 
is evident in track 7. The higher-resolution genetic map identified a 
new syntenic alignment of Triticeae group 5 to Brachypodium chro- 
mosome 3 genes. 


Genome change in polyploid wheat 

We determined the influence of polyploidy on gene content in hexa- 
ploid wheat by defining the sizes of gene families in hexaploid wheat 
and the diploid progenitor Ae. tauschii from the copy number of genes 


Table 2 | Assembly statistics of the orthologous group assembly, the LCG and cDNA assemblies 


Orthologous group assembly* (99% m.i.) LCG+ cDNA assembly+ 
Number of sequences 949,279 5,321,847 97,481 
Total sequence (bp) 437,512,281 3,800,325,216 93,340,842 
Minimum length; maximum 79; 7,312 100; 21,721 100; 10,382 
length (bp) 
N10; N50; N90 (bp) 766; 481; 331 2,234; 884; 420 2,107; 1,325; 509 
Mean length (bp) 460.89 714.10 957.53 
GC content (%) 48.25 47.69 47.74 


* Combined set of 454 sequences that cluster and form contigs and 454 sequences that remain singletons. 


+ Set of 454 sequences that cluster and form contigs. 
bp, base pair. 


706 | NATURE | VOL 491 | 29 NOVEMBER 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


OGR (%) 
Median depth 


ARTICLE 


i) 


OGR (%) 
a 


0 20 40 60 80 100 0 20 40 


Coverage of OGR (%) 


Figure 1 | Coverage of OGRs by wheat 454 sequence reads and simulated 
454 reads from rice and maize. a, Coverage of OGRs by repeat-masked wheat 
454 sequence reads (black line), wheat LCG (black dashed line) and the 
orthologous group assembly (blue line), together with rice genes (red line) and 
maize simulated reads (green line). b, Median coverage depth over protein- 


for each OGR, which were then paired with the gene family size of 
the OGR in sequenced diploid grasses (Supplementary Information, 
section 2.6). The mean family size was 1.4 members. Supplementary 
Fig. 8 shows relationships between wheat and diploid orthologous 
gene family across the full scale of orthologous gene family sizes. 
This approach accurately reconstructed gene family sizes in simulated 
maize and ‘hexaploid’ rice genomes (Figs 3a, b), although larger gene 
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Figure 2 | Alignment of wheat 454 reads, SNPs and genetic maps to the 

B. distachyon genome. The inner circle represent gene order on the five 
Brachypodium chromosomes (Bd1-Bd5). Track 1 illustrates conservation 
between wheat 454 reads and Brachypodium genes, shown as a window of genes 
present in wheat. Tracks 2-4 show SNP density (the mean number of SNPs per 
gene in a window of 20 genes) in the A (track 2), B (track 3) and D (track 4) 
genomes of wheat. Tracks 5-7 show wheat synteny with Brachypodium for the 
A (track 5), B (track 6) and D (track7) genomes. Genetic markers” (shown in 
darker colours) are colour-coded by wheat chromosome. Gaps between 
markers are filled in to show synteny (lighter colours). 
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coding regions of OGRs (amino terminus = 0; carboxy terminus = 100). The 
colour coding is the same as in a, except simulated hexaploid reads from rice 
(red line) were used. c, Distribution of wheat gene copy numbers from the 
orthologous group assembly. 


family sizes tended to be underestimated. Figure 3c, d shows the rela- 
tionships between Ae. tauschii and wheat genes. Single-member gene 
families in hexaploid wheat and Ae. tauschii were maintained to a 
similar extent as those seen in sequenced diploid grasses, consistent 
with Southern blot analyses of single-copy genes”. Using the D genome 
as a diploid reference, we calculated the Triticeae hexaploid/diploid 
gene family size ratio to be between 2.5:1 and 2.7:1, derived from the 
geometric mean (2.5:1) and the slopes of the blue line and the red line 
(2.7:1) in Fig. 3e. Comparing this with the expected hexaploid/diploid 
ratio of 3:1 indicates the loss of between 10,000 and 16,000 genes in 
hexaploid wheat compared with the three diploid progenitors (Sup- 
plementary Information, section 2.10). This is consistent with earlier 
studies of gene loss in newly synthesized wheat polyploids* and the 
erosion of genetic diversity during wheat domestication’. 

Despite this overall trend of gene family size reduction, gene 
families with fewer or more members than expected were identified 
in Ae. tauschii and hexaploid wheat, as shown by green dots (more 
members) and brown dots (fewer members) in Fig. 3c (Ae. tauschii) 
and Fig. 3d (hexaploid wheat). Supplementary Tables 10-12 show the 
over- and under-represented functional categories of protein. Most 
of the over-represented categories in expanded gene families are 
common to wheat and Ae. tauschii: these include ribosome proteins, 
components of photosystem II, storage proteins, transposon-related 
proteins, cytochrome P450s, NB-ARC domain proteins involved in 
defence responses, proteins related to pollen allergens and F-box 
proteins. Five of the eleven families encoding hydrogen ion transmem- 
brane transporters were significantly more numerous in Ae. tauschii 
than in wheat. Analysis of gene families (Supplementary Fig. 9) showed. 
that they encode different subunits of ATPases. We speculate that they 
may provide proton gradients to support Na” exclusion in Ae. tauschii® 
and the accumulation of minerals in other Aegilops species”®. 


Pseudogene analysis 


Several classes of plant DNA transposons*’** and retroelements™ 
create and amplify gene fragments, disrupt genes and create pseudo- 
genes, which can influence gene expression through epigenetic 
mechanisms™*. We identified a set of almost 233,000 gene fragments 
that mapped to the same regions of their OGRs, forming ‘stacks’ that 
were sufficiently divergent not to assemble into their cognate gene 
assemblies (Fig. 4a). Two classes were identified: those containing 
Pfam domains and those aligning with non-Pfam domains of OGRs. 
Nearly 30% of the OGRs had associated gene fragments (Supplemen- 
tary Table 13) that most frequently covered between 5 and 15% of the 
OGR length (Fig. 4b). Figure 4c shows that the alignment identities 
of gene fragments against their OGRs were substantially lower than 
the identities of cognate regions within wheat gene assemblies. Sup- 
plementary Fig. 10 shows the distribution of stacks along genes and the 
ratio of non-synonymous to synonymous substitutions (K,/K,) along 
the genes. Pfam domains found in stacks were enriched for zinc-finger 
motifs in mutator transposons (Supplementary Table 14), consistent 
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Figure 3 | Gene family sizes in orthologous assemblies of hexaploid wheat, 
Ae. tauschii, simulated maize and hexaploid rice. ‘The boxes and whiskers 
contain 50% and 90% of the orthologous group assembly genes, respectively. 
The box colours indicate the number of genes in diploid gene families of 
different sizes. The black lines represent expected gene family sizes, and the red 
lines show the gene family sizes determined from the orthologous group 
assembly, derived by polynomial regression fit. Only gene families with up to 
ten members are shown. a, Maize gene family sizes predicted from orthologous 
assembly of simulated 454 reads. b, Rice gene family sizes predicted from 
orthologous assembly of simulated 454 reads derived from triplicated rice 


with their role in pseudogene formation*’. F-box, protein kinase and 
NB-ARC domains, which are found in the most rapidly evolving gene 
families in plants’*°, are also over-represented. 


Determining homeologous relationships of gene 
assemblies 


We classified gene assemblies as A-, B- or D-genome-derived 
according to sequence similarity to Illumina sequence assemblies 
from T. monococcum, cDNA assemblies from Ae. speltoides and, res- 
pectively, 454 reads from Ae. tauschii by applying a support vector 
machine learning approach (Supplementary Section 5, Supplemen- 
tary Figs 11 and 12, and Supplementary Tables 15-18). Supplementary 
Fig. 13 shows that 66% of the gene assemblies were classified with high 
overall precision (>70%) and recall into the A genome (28.3%), the 
B genome (29.2%) and the D genome (33.8%). The other 9% of clas- 
sified assemblies have stop codons. The othes 34% with low classifica- 
tion probabilities are likely to be very similar homeologues. 
Comparison with a subset of A-, B- and D-genome SNPs confirmed 
72% of A-genome classifications and 85% of D-genome classifications 
(Fig. 2 and Supplementary Table 19). Discrimination of putative 
B-genome genes was only ~60%, possibly owing both to the use of 
cDNA sequences for classification when most of the informative 
sequence polymorphisms are intronic, and to uncertainty about the 


a b 


es} 


| 


Identified stacks (%) 
Le) 
ao 


40 


Stack Subassemblies Reads 


12345678910 
Orthologous gene family size 


S 
SOPH HD CPF LSOS 
ve 
Number of orthologous genes 


genes. c, Aegilops tauschii gene family sizes obtained from orthologous 
assembly of repeat-masked 454 reads. Expanded gene families are shown as 
green dots. d, Wheat gene family sizes in the orthologous group assembly. 

e, Amalgamation of wheat and Ae. tauschii gene copy numbers. The black line 
shows the respective expected gene copy numbers for wheat and Ae. tauschii. 
The red line shows the regression fit for wheat, and the blue line shows the 
regression fit for Ae. tauschii. The grey zone between these lines estimates the 
extent of gene loss in hexaploid wheat. For each family size, the left-hand boxes 
represent hexaploid wheat and the right-hand boxes represent Ae. tauschii. 


ancestry of the B genome’. The set of 132,552 SNPs allocated to the A, 
B and D genomes is displayed using Brachypodium as a template in 
tracks 2-4 of Fig. 2. 

There were no significant differences between the respective distri- 
butions of GO Slim molecular function categories in the A, B and 
D genes (Supplementary Fig. 14), indicating that at this level of func- 
tional categorization there is no biased gene loss” in any of the genomes. 
Nevertheless, analysis of GO Slim terms associated with stop codons in 
A, B and D gene assemblies showed that there was a strong tendency to 
retain functional copies of genes encoding transcription factors in all 
three genomes (Supplementary Fig. 15), similar to the preferential 
retention of these genes in Arabidopsis genome duplications’’. This 
indicates that genome-specific transcriptional regulatory networks 
tend to be maintained in wheat. 


Conclusions 


Using whole-genome 454 sequencing, we assembled gene sequences 
representing an essentially complete gene set, and a significant number 
were assigned to the A, B or D genome. Although the assemblies are 
fragmentary, they form a powerful framework for identifying genes, 
accelerating further genome sequencing and facilitating genome-scale 
analyses. The identification of over 132,000 SNPs in A, B and D genes 
facilitates analysis of quantitative trait loci and association studies of 
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Figure 4 | Pseudogene identification and analysis. a, Visualization of an 
OGR and associated wheat sequences. The top track shows the hit count profile 
of mapped 454 reads. The lower tracks show subassemblies of three wheat genes 
and a stacked region of gene fragments. Read depth is represented by the heat 
map. b, Coverage of the OGR by Pfam-containing gene fragments and 


708 | NATURE | VOL 491 | 29 NOVEMBER 2012 


Coverage of OGR (%) 


20 30 40 70 75 80 85 90 95 100 


Alignment identity (%) 


pseudogenes. The blue and red lines represent stacks with and without protein 
domains, respectively. c. Protein identity between subassemblies forming stacks 
of gene fragments. The blue and red lines represent stacks with and without 
protein domains, respectively, and the black line represents subassemblies 
forming genes. 
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traits. Comparison with the sequences of diploid progenitors and rela- 
tives showed pronounced reductions in the size of large gene families in 
wheat despite the relatively recent formation of the hexaploid (Fig. 3e), 
consistent with smaller-scale analyses*”*. The scale of gene loss in 
hexaploid wheat compared with maize*® and Brassica rapa’ is signifi- 
cantly smaller, possibly as a result of its relatively recent origin and the 
absence of intergenome recombination*’. Nevertheless, gene loss in 
wheat could be rapid, as shown in the newly created allopolyploid 
Tragopogon miscellus*’. Most functional classes show equal gene loss 
in the three genomes, but families of transcription factors showed a 
clear tendency to be retained as functional genes in all three genomes. 
These may maintain transcriptional networks in each genome and 
contribute to non-additive gene expression” and genome plasticity. 
In contrast to the overall loss of gene family members, several classes of 
gene families with predicted roles in defence, nutritional content, 
energy metabolism and growth have increased sizes in the Triticeae 
lineage, possibly as a result of selection during domestication. 

Major efforts are underway to improve wheat productivity by 
increasing genetic diversity in breeding materials and through genetic 
analysis of traits’. The genomic resources that we have developed 
promise to accelerate progress by facilitating the identification of useful 
variation in genes of wheat landraces and progenitor species, and by 
providing genomic landmarks to guide progeny selection. Analysis of 
complex polygenic traits such as yield and nutrient use efficiency will 
also be accelerated, contributing to sustainable increases in wheat crop 
production. 


METHODS SUMMARY 


A single-seed descent line of T. aestivum landrace Chinese Spring was sequenced, 
because it is widely used for cytogenetic analysis** and physical mapping’. 
Triticum monococcum accession 4342-96 is a community standard line for tar- 
geting induced local lesions in genomes, physical mapping and genetic analysis; 
and Ae. tauschii ssp strangulata accession AL8/78, which is used for physical and 
genetic mapping, was sequenced using 454 technology. 

Sequence for the T. aestivum wheat gene assembly was generated using Roche 
454 pyrosequencing on the GS FLX Titanium and GS FLX+ platforms. Additional 
sequence read data sets for T. aestivum, T.monococcum and Ae. tauschii were 
generated using three platforms, Illumina, 454 and SOLiD, to analyse home- 
ologous sequences and SNPs (a list of all data sets is in Supplementary Table 2). 
Orthologous groups were created from rice, sorghum and B. distachyon genome 
sequences and barley full-length cDNA sequences. Wheat gene assemblies were 
named according to their OGR and were identified by a seven-digit identifier and 
their predicted genome (for example Traes_Bradilg12345_0000001_D and 
Traes_Sb3g33333_6543210_A). Gene and cDNA assemblies can be searched at 
the MIPS Wheat Genome Database (http://mips.helmholtz-muenchen.de/plant/ 
wheat/uk454survey/index.jsp). All sequence data has been deposited in publicly 
accessible databases, described in Supplementary Information. Sequence assem- 
blies, annotated gene sequences and their relationships are available for download 
from the European Bioinformatics Institute (www.ebi.ac.uk) and viewing in a 
synteny-based Ensembl genome browser. Annotated gene sequences and their 
relationships can be viewed in a Brachypodium synteny-based Ensembl genome 
browser (http://plants.ensembl.org/brachypodium_distachyon). 
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A physical, genetic and functional 
Sequence assembly of the barley genome 


The International Barley Genome Sequencing Consortium* 


Barley (Hordeum vulgare L.) is among the world’s earliest domesticated and most important crop plants. It is diploid 
with a large haploid genome of 5.1 gigabases (Gb). Here we present an integrated and ordered physical, genetic and 
functional sequence resource that describes the barley gene-space in a structured whole-genome context. We developed 
a physical map of 4.98 Gb, with more than 3.90 Gb anchored to a high-resolution genetic map. Projecting a deep 
whole-genome shotgun assembly, complementary DNA and deep RNA sequence data onto this framework supports 
79,379 transcript clusters, including 26,159 ‘high-confidence’ genes with homology support from other plant genomes. 
Abundant alternative splicing, premature termination codons and novel transcriptionally active regions suggest that 
post-transcriptional processing forms an important regulatory layer. Survey sequences from diverse accessions reveal a 
landscape of extensive single-nucleotide variation. Our data provide a platform for both genome-assisted research and 


enabling contemporary crop improvement. 


Cultivated barley, derived from its wild progenitor Hordeum vulgare 
ssp. spontaneum, is among the world’s earliest domesticated crop 
species and today represents the fourth most abundant cereal in both 
area and tonnage harvested (http://faostat.fao.org). Approximately 
three-quarters of global production is used for animal feed, 20% is 
malted for use in alcoholic and non-alcoholic beverages, and 5% as an 
ingredient in a range of food products’. Barley is widely adapted to 
diverse environmental conditions and is more stress tolerant than its 
close relative wheat’. As a result, barley remains a major food source in 
poorer countries*, maintaining harvestable yields in harsh and marginal 
environments. In more developed societies it has recently been classified 
as a true functional food. Barley grain is particularly high in soluble 
dietary fibre, which significantly reduces the risk of serious human dis- 
eases including type II diabetes, cardiovascular disease and colorectal 
cancers that afflict hundreds of millions of people worldwide’. The USA 
Food and Drug Administration permit a human health claim for cell- 
wall polysaccharides from barley grain. 

As a diploid, inbreeding, temperate crop, barley has traditionally 
been considered a model for plant genetic research. Large collections 
of germplasm containing geographically diverse elite varieties, land- 
races and wild accessions are readily available® and undoubtedly con- 
tain alleles that could ameliorate the effect of climate change and 
further enhance dietary fibre in the grain. Enriching its broad natural 
diversity, extensive characterized mutant collections containing all 
of the morphological and developmental variation observed in the 
species have been generated, characterized and meticulously main- 
tained. The major impediment to the exploitation of these resources 
in fundamental and breeding science has been the absence ofa reference 
genome sequence, or an appropriate enabling alternative. Providing 
either of these has been the primary research challenge to the global 
barley community. 

In response to this challenge, we present a novel model for deliver- 
ing the genome resources needed to reinforce the position of barley as 
a model for the Triticeae, the tribe that includes bread and durum 
wheats, barley and rye. We introduce the barley genome gene space, 
which we define as an integrated, multi-layered informational 
resource that provides access to the majority of barley genes in a 


highly structured physical and genetic framework. In association with 
comparative sequence and transcriptome data, the gene space provides 
a new molecular and cellular insight into the biology of the species, 
providing a platform to advance gene discovery and genome-assisted 
crop improvement. 


A sequence-enriched barley physical map 

We constructed a genome-wide physical map of the barley cultivar 
(cv.) Morex by high-information-content fingerprinting’ and contig 
assembly® of 571,000 bacterial artificial chromosome (BAC) clones 
(~14-fold haploid genome coverage) originating from six indepen- 
dent BAC libraries’. After automated assembly and manual curation, 
the physical map comprised 9,265 BAC contigs with an estimated 
N50 contig size of 904 kilobases and a cumulative length of 4.98 Gb 
(Methods, Supplementary Note 2). It is represented by a minimum 
tiling path (MTP) of 67,000 BAC clones. Given a genome size of 
5.1 Gb"°, more than 95% of the barley genome is represented in the 
physical map, comparing favourably to the 1,036 contigs that represent 
80% of the 1 Gb wheat chromosome 3B". 

We enhanced the physical map by integrating shotgun sequence 
information from 5,341 gene-containing’*”* and 937 randomly selected 
BAC clones (Methods, Supplementary Notes 2 and 3, and Sup- 
plementary Table 4), and 304,523 BAC-end sequence (BES) pairs 
(Supplementary Table 3). These provided 1,136 megabases (Mb) of 
genomic sequence integrated directly into the physical map (Sup- 
plementary Tables 3 and 4). This framework facilitated the incorpora- 
tion of whole-genome shotgun sequence data and integration of the 
physical and genetic maps. We generated whole-genome shotgun 
sequence data from genomic DNA of cv. “Morex’ by short-read 
Illumina GAIIx technology, using a combination of 300 base pairs 
(bp) paired-end and 2.5kb mate-pair libraries, to >50-fold haploid 
genome coverage (Supplementary Note 3.3). De novo assembly resulted 
in sequence contigs totalling 1.9Gb. Due to the high proportion of 
repetitive DNA, a substantial part of the whole-genome shotgun data 
collapsed into relatively small contigs characterized by exceptionally 
high read depths. Overall, 376,261 contigs were larger than 1 kb 
(N50 = 264,958 contigs, N50 length = 1,425 bp). Of these, 112,989 


*A list of authors and their affiliations appears at the end of the paper. 
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(308 Mb) could be anchored directly to the sequence-enriched physical 
map by sequence homology. 

We implemented a hierarchical approach to further anchor the 
physical and genetic maps (Methods, Supplementary Note 4). A total 
of 3,241 genetically mapped gene-based single-nucleotide variants 
(SNV) and 498,165 sequence-tag genetic markers’* allowed us to 
use sequence homology to assign 4,556 sequence-enriched physical 
map contigs spanning 3.9 Gb to genetic positions along each barley 
chromosome. An additional 1,881 contigs were assigned to chromo- 
somal bins by sequence homology to chromosome-arm-specific 
sequence data sets’? (Supplementary Note 4.4). Thus, 6,437 physical 
map contigs totalling 4.56 Gb (90% of the genome), were assigned to 
chromosome arm bins, the majority in linear order. Non-anchored 
contigs were typically short and lacked genetically informative 
sequences required for positional assignment. 

Consistent with genome sequences of other grass species’® the peri- 
centromeric and centromeric regions of barley chromosomes exhibit 
significantly reduced recombination frequency, a feature that com- 
promises exploitation of genetic diversity and negatively impacts 
genetic studies and plant breeding. Approximately 1.9 Gb or 48% of 
the genetically anchored physical map (3.9 Gb) was assigned to these 
regions (Fig. 1 and Supplementary Fig. 11). 


Repetitive nature of the barley genome 


A characteristic of the barley genome is the abundance of repetitive 
DNA”. We observed that approximately 84% of the genome is com- 
prised of mobile elements or other repeat structures (Supplementary 


= 


Legend to heat map: 


Track b genes per cM min: 0, max: 80 men 
Track d genes per Mb min: 0, max: 20 

Track e LTR retroelements (%) min: 0, max: 100 

Track f DNA transposons (%) min: 0, max: 20 min 


Track g sequenced BAC clones per Mb 0 | 20 


Figure 1 | Landscape of the barley gene space. Track a gives the seven barley 
chromosomes. Green/grey colour depicts the agreement of anchored 
fingerprint (FPC) contigs with their chromosome arm assignment based on 
chromosome-arm-specific shotgun sequence reads (for further details see 
Supplementary Note 4). For 1H only whole-chromosome sequence assignment 
was available. Track b, distribution of high-confidence genes along the genetic 
map; track c, connectors relate gene positions between genetic and the 
integrated physical map given in track d. Position and distribution of track 

e class I LTR-retroelements and track f class II DNA transposons are given. 
Track g, distribution and positioning of sequenced BACs. 
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Note 5). The majority (76% in random BACs) of these consists of 
retrotransposons, 99.6% of which are long terminal repeat (LTR) 
retrotransposons. The non-LTR retrotransposons contribute only 
0.31% and the DNA transposons 6.3% of the random BAC sequence. 
In the fraction of the genome with a high proportion of repetitive 
elements, the LTR Gypsy retrotransposon superfamily was 1.5-fold 
more abundant than the Copia superfamily, in contrast to observa- 
tions in both Brachypodium* and rice’’. However, gene-bearing 
BACs were slightly depleted of retrotransposons, consistent with 
Brachypodium'* where young Copia retroelements are preferentially 
found in gene-rich, recombinogenic regions from which inactive 
Gypsy retroelements have been lost by LTR-LTR recombination. 
Overall, we see reduced repetitive DNA content within the terminal 
10% of the physical map of each barley chromosome arm (Fig. 1). 
Class I and II elements show non-quantitative reverse-image distri- 
bution along barley chromosomes (Fig. 1), a feature shared with other 
grass genomes'®”° and shown by fluorescence in situ hybridization 
(FISH) mapping’. Not surprisingly, the whole-genome shotgun 
assembly shows a lower abundance of LTR retrotransposons (average 
53%) than gene-bearing BACs. That LTR retrotransposons are long 
(~10kb), highly repetitive and often nested*’ supports our assump- 
tion that short reads either collapsed or did not assemble. Short inter- 
spersed elements (SINEs)”’, short (80-600 bp) non-autonomous 
retrotransposons that are highly repeated in barley, showed no dif- 
ferential exclusion from the assemblies. However, miniature inverted- 
repeat transposable elements (MITEs), small non-autonomous DNA 
transposons”, were twofold enriched in the whole-genome shotgun 
assemblies compared with BES reads or random BACs, consistent 
with the gene richness of the assemblies and their association with 
genes’. Both MITEs and SINEs are 1.5 to 2-fold enriched in gene- 
bearing BACs which could indicate that SINEs are also preferentially 
integrated into gene-rich regions, or because they are older than LTR 
retroelements, may simply remain visible in and around genes where 
retro insertions have been selected against. 


Transcribed portion of the barley genome 


The transcribed complement of the barley gene space was annotated 
by mapping 1.67 billion RNA-seq reads (167 Gb) obtained from eight 
stages of barley development as well as 28,592 barley full-length 
cDNAs” to the whole-genome shotgun assembly (Methods, Sup- 
plementary Notes 6, 7 and Supplementary Tables 20-22). Exon detec- 
tion and consensus gene modelling revealed 79,379 transcript clusters, 
of which 75,258 (95%) were anchored to the whole-genome shotgun 
assembly (Supplementary Notes 7.1.1 and 7.1.2). Based on a gene- 
family-directed comparison with the genomes of Sorghum, rice, 
Brachypodium and Arabidopsis, 26,159 of these transcribed loci fall 
into clusters and have homology support to at least one reference 
genome (Supplementary Fig. 16); they were defined as high-confidence 
genes. Comparison against a data set of metabolic genes in Arabidopsis 
thaliana’ indicated a detection rate of 86%, allowing the barley gene- 
set to be estimated as approximately 30,400 genes. Due to lack of 
homology and missing support from gene family clustering, 53,220 
transcript loci were considered low-confidence (Table 1). High- 
confidence and low-confidence barley genes exhibited distinct charac- 
teristics: 75% of the high-confidence genes had a multi-exon structure, 
compared with only 27% of low-confidence genes (Table 1). The mean 
size of high-confidence genes was 3,013 bp compared with 972 bp for 
low-confidence genes. A total of 14,481 low-confidence genes showed 
distant homology to plant proteins in public databases (Supplementary 
Notes 7.1.2, 7.1.4 and Supplementary Fig. 18), identifying them as 
potential gene fragments known to populate Triticeae genomes at high 
copy number and that often result from transposable element activity”®. 

A total of 15,719 high-confidence genes could be directly associated 
with the genetically anchored physical map (Supplementary Note 4). 
An additional 3,743 were integrated by invoking a conservation of syn- 
teny model (Supplementary Note 4.5) and a further 4,692 by association 
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Table 1 | Characteristics of high-confidence and low-confidence 
gene sets in barley 


High confidence Low confidence 


Number of genes 26,159 53,220 
Gene loci positioned on barley cultivar 24,243 (93%) 51,015 (96%) 
Morex assembly* 

Single exon 5,954 (25%) 37,395 (73%) 
ulti exon 18,289 (75%) 13,620 (27%) 
umber of distinct exonst 184,710 107,768 
ean number of distinct exons per gene 7.62 2.11 
umber of genes with alternative 13,299 (55%) 8,214 (16%) 

transcript variants 

Total number of predicted transcripts 62,426 69,266 
ean number of transcripts per gene 2.58 1.36 
ean gene locus size (first to last exon) 3,013 bp 972 bp 
ean transcript size (UTR, CDS) 1,878bp 931 bp 
ean exon size 454 bp 536 bp 

Gene loci not positioned on barley cv. 1,916 (7%) 2,205 (4%) 
Morex assemblyt 

Tagged by unmapped RNA-seq reads 1,657 (86%) 1,127 (51%) 

ot tagged by unmapped RNA-seq reads 259 (14%) 1,078 (49%) 


* Gene locus representatives are (1) RNA-seq based transcript or (2) barley fl-cDNA that were mapped 
to the barley cultivar Morex assembly or tagged by RNA-seq based transcript during clustering. 

+ Exons of two or more transcripts were counted once if they have identical start and stop positions. 
{Gene locus representatives are barley fl-cDNAs that were not mapped to the barley cultivar Morex 
assembly and not matched by any RNA-seq based transcript 

CDS, coding sequence. 


with chromosome arm whole-genome shotgun data (Supplementary 
Note 4.4 and Supplementary Table 15). Importantly, the N50 length 
of whole-genome shotgun sequence contigs containing high-confidence 
genes was 8,172 bp, which is generally sufficient to include the entire 
coding sequence, and 5’ and 3’ untranslated regions (UTRs). Overall 
24,154 high-confidence genes (92.3%) were associated and positioned in 
the physical/genetic scaffold, representing a gene density of five genes 
per Mb. Proximal and distal ends of chromosomes are more gene-rich, 
on average containing 13 genes per Mb (Fig. 1). 

In comparison with sequenced model plant genomes, gene family 
analysis (Supplementary Note 7.1.3) revealed some gene families that 
exhibited barley-specific expansion. We defined the functions of 
members of these families using gene ontology (GO) and PFAM 
protein motifs (Supplementary Table 25). Gene families with highly 
overrepresented GO/PFAM terms included genes encoding (1,3)-B- 
glucan synthases, protease inhibitors, sugar-binding proteins and 
sugar transporters. NB-ARC (a nucleotide-binding adaptor shared 
by APAF-1, certain R gene products and CED-4’’) domain proteins, 
known to be involved in defence responses, were also overrepre- 
sented, including 191 NBS-LRR type genes. These tended to cluster 
towards the distal regions of barley chromosomes (Supplementary 
Fig. 17), including a major group on barley chromosome 1HS, co- 
localizing with the MLA powdery mildew resistance gene cluster’*. 
Biased allocation to recombination-rich regions provides the genomic 
environment for generating sequence diversity required to cope with 
dynamic pathogen populations””®. It is noteworthy that the highly 
over-represented (1,3)-B-glucan synthase genes have also been impli- 
cated in plant-pathogen interactions*’. 


Regulation of gene expression 


Deep RNA sequence data (RNA-seq) provided insights into the spatial 
and temporal regulation of gene expression (Supplementary Note 7.2). 
We found 72-84% of high-confidence genes to be expressed in all spa- 
tiotemporal RNA-seq samples (Fig. 2a), slightly lower than reported 
for rice** where ~95% of transcripts were found in more than one 
developmental or tissue sample. More importantly, 36-55% of high- 
confidence barley genes seemed to be differentially regulated between 
samples (Fig. 2b), highlighting the inherent dynamics of barley gene 
expression. 

Two notable features support the importance of post-transcriptional 
processing as a central regulatory layer (Supplementary Notes 7.3 and 
7.4). First, we observed evidence for extensive alternative splicing. Of 
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the intron-containing high-confidence barley genes, 73% had evidence 
of alternative splicing (55% of the entire high-confidence set). The 
spatial and temporal distribution of alternative splicing transcripts 
deviated significantly from the general occurrence of transcripts in 
the different tissues analysed (Fig. 2c). Only 17% of alternative splicing 
transcripts were shared among all samples, and 17-27% of the alterna- 
tive splicing transcripts were detected only in individual samples, 
indicating pronounced alternative splicing regulation. We found 
2,466 premature termination codon-containing (PTC+) alterna- 
tive splicing transcripts (9.4% of high-confidence genes) (Fig. 2d 
and Table 2), similar to the percentage of nonsense-mediated decay 
(NMD)-controlled genes in a wide range of species****. Premature 
termination codons activate the NMD pathway”, which leads to rapid 
degradation of PTC+ transcripts, and have been associated with trans- 
criptional regulation during disease and stress response in human and 
Arabidopsis, respectively’**°°. The distribution of PTC+ transcripts 
was strikingly dissimilar, both spatially and temporally, with only 7.4% 
shared and between 31% and 40% exclusively observed in only a 
single sample (Fig. 2d). Genes encoding PTC-+ -containing transcripts 
show a broad spectrum of GO terms and PFAM domains and are more 
prevalent in expanded gene families. These observations support a 
central role for alternative splicing/NMD-dependent decay of PTC+ 
transcripts as a mechanism that controls the expression of many dif- 
ferent barley genes. 

Second, recent reports have highlighted the abundance of novel 
transcriptionally active regions in rice that lack homology to protein- 
coding genes or open reading frames (ORFs)*°. In barley as many as 
27,009 preferentially single-exon low-confidence genes can be classified 
as putative novel transcriptionally active regions (Supplementary Note 
7.1.4). We investigated their potential significance by comparing the 
homology of barley novel transcriptionally active regions with the rice 
and Brachypodium genomes that respectively represent 50 and 30 
million years of evolutionary divergence’*. A total of 4,830 and 2,450 
novel transcriptionally active regions yielded a homology match to the 
Brachypodium and rice genomes, respectively (intersection of 2,046; 
BLAST P value = 10 °), indicating a putative functional role in pre- 
mRNA processing or other RNA regulatory processes**’. 


Natural diversity 


Barley was domesticated approximately 10,000 years ago’. Extensive 
genotypic analysis of diverse germplasm has revealed that restricted 
outcrossing (0-1.8%)*, combined with low recombination in peri- 
centromeric regions, has resulted in modern germplasm that shows 
limited regional haplotype diversity“. We investigated the frequency 
and distribution of genome diversity by survey sequencing four 
diverse barley cultivars (“Bowman’, ‘Barke’, ‘Igri’ and “Haruna Nijo’) 
and an H. spontaneum accession (Methods and Supplementary Note 8) 
to a depth of 5-25-fold coverage, and mapping sequence reads against 
the barley cultivar ‘Morex’ gene space. We identified more than 15 
million non-redundant single-nucleotide variants (SNVs). H. sponta- 
neum contributed almost twofold more SNV than each of the cultivars 
(Supplementary Table 28). Up to 6 million SNV per accession could be 
assigned to chromosome arms, including up to 350,000 associated with 
exons (Supplementary Table 29). Approximately 50% of the exon- 
located SNV were integrated into the genetic/physical framework 
(Fig. 3, Supplementary Table 30 and Supplementary Fig. 31), providing 
a platform to establish true genome-wide marker technology for high- 
resolution genetics and genome-assisted breeding. 

We observed a decrease in SNV frequency towards the centromeric 
and peri-centromeric regions of all barley chromosomes, a pattern 
that seemed more pronounced in the barley cultivars. This trend was 
supported by SNV identified in RNA-seq data from six additional 
cultivars mapped onto the Morex genomic assembly (Supplemen- 
tary Note 8.2). We attribute this pattern of eroded genetic diversity 
to low recombination in the pericentromeric regions, which reduces 
effective population size and consequently haplotype diversity. Whereas 
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Figure 2 | Atlas of barley gene expression. a, Barley gene expression in 
different spatial and temporal RNA-seq samples (Supplementary Notes 6, 7). 
Numbers refer to high-confidence genes. b, Dendrogram depicting relatedness 
of samples and colour-coded matrix showing number of significantly 
upregulated high-confidence genes in pairwise comparisons. ©, total number of 
non-redundant high-confidence genes upregulated in comparison to all other 


Table 2 | Alternative splicing and transcripts containing PTCs in 
high-confidence genes 


General statistics of alternative splicing in high-confidence genes 


High-confidence genes with RNA-seq data to monitor 24,243 

alternative splicing 

Predicted transcripts at high-confidence genes 62,426 
Transcripts with complete CDS structures* 62,256 
Transcripts with partial CDS structurest 170 

Genes with alternative transcripts 13,299 

Predicted transcripts derived from genes with 51,482 

alternative splicing 

Premature stop codon analysis 

Predicted transcripts used for PTC analysist 51,338 

Transcripts without PTC 41,461 (81%) 

Transcripts containing PTC 9,877 
PTC caused by intron retention 5,286 (10%) 
PTC-+ transcripts predicted to be NMD****-sensitive 4,591 (9%) 

Gene loci incorporating PTC+/NMD transcripts 2,466 


* Entire predicted coding sequence (100%) was transferred to transcript model on barley cultivar 
Morex contigs. 

+ Predicted coding sequence could not be completely projected to genomic transcript model (partial 
mapping of fl-cDNA). 

{Only transcripts with structures for entire coding sequence on barley cultivar Morex WGS assembly 
were considered. 

CDS, coding sequence. 
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samples. Height, complete linkage cluster distance (log>(fragments per kilobase 
of exon per million fragments mapped)); see Supplementary Note 7.2.5.1. 

c, Distribution and overlap of alternatively spliced barley transcripts between 
RNA-seq samples. d, Distribution and overlap of alternative splicing transcripts 
fulfilling criteria for PTC+ as detected in different spatial and temporal RNA- 
seq samples (Supplementary Note 7.4). 


H. spontaneum may serve here as a reservoir of genetic diversity, using 
this diversity may itself be compromised by restricted recombination 
and the consequent inability to disrupt tight linkages between desirable 
and deleterious alleles. Surprisingly, the short arm of chromosome 4H 
had a significantly lower SNV frequency than all other barley chromo- 
somes (Supplementary Fig. 33). This may be a consequence of a further 
reduction in recombination frequency on this chromosome, which is 
genetically (but not physically) shortest. Reduced SNV diversity was also 
observed in regions we interpret to be either the consequences of recent 
breeding history or could indicate landmarks of domestication (Fig. 3). 


Discussion 


The size of Triticeae cereal genomes, due to their highly repetitive 
DNA composition, has severely compromised the assembly of whole- 
genome shotgun sequences and formed a barrier to the generation of 
high-quality reference genomes. We circumvented these problems by 
integrating complementary and heterogeneous sequence-based geno- 
mic and genetic data sets. This involved coupling a deep physical map 
with high density genetic maps, superimposing deep short-read whole- 
genome shotgun assemblies, and annotating the resulting linear, albeit 
punctuated, genomic sequence with deep-coverage RNA-derived data 
(full-length cDNA and RNA-seq). This allowed us to systematically 
delineate approximately 4 Gb (80%) of the genome, including more 
than 90% of the expressed genes. The resulting genomic framework 
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Figure 3 | Single nucleotide variation (SNV) frequency in barley. Barley 
chromosomes indicated as inner circle of grey bars. Connector lines give the 
genetic/physical relationship in the barley genome. SNV frequency distribution 
displayed as five coloured circular histograms (scale, relative abundance of 
SNVs within accession; abundance, total number of SNVs in non-overlapping 
50-kb intervals of concatenated ‘Morex’ genomic scaffold; range, zero to 
maximum number of SNVs per 50-kb interval). Selected patterns of SNV 
frequency indicated by coloured arrowheads (for further details see 
Supplementary Note 8). Colouring of arrowheads refers to cultivar with 
deviating SNV frequency for the respective region. 


provides a detailed insight into the physical distribution of genes and 
repetitive DNA and how these features relate to genetic characteristics 
such as recombination frequency, gene expression and patterns of 
genetic variation. 

The centromeric and peri-centromeric regions of barley chromo- 
somes contain a large number of functional genes that are locked into 
recombinationally ‘inert’ genomic regions*°. The gene-space distri- 
bution highlights that these regions expand to almost 50% of the 
physical length of individual chromosomes. Given well-established 
levels of conserved synteny, this will probably be a general feature of 
related grass genomes that will have important practical implications. 
For example, infrequent recombination could function to maintain 
evolutionarily selected and co-adapted gene complexes. It will certainly 
restrict the release of the genetic diversity required to decouple advant- 
ageous from deleterious alleles, a potential key to improving genetic 
gain. Understanding these effects will have important consequences 
for crop improvement. Moreover, for gene discovery, forward genetic 
strategies based on recombination will not be effective in these regions. 
Whereas alternative approaches exist for some targets (for example, 
by coupling resequencing technologies with collections of natural or 
induced mutant alleles), for most traits it remains a serious impedi- 
ment. Some promise may lie in manipulating patterns of recombina- 
tion by either genetic or environmental intervention’’. Quite strikingly, 
our data also reveal that a complex layer of post-transcriptional regu- 
lation will need to be considered when attempting to link barley genes 
to functions. Connections between post-transcriptional regulation 
such as alternative splicing and functional biological consequences 
remain limited to a few specific examples”, but the scale of our obser- 
vations suggest this list will expand considerably. 

In conclusion, the barley gene space reported here provides an 
essential reference for genetic research and breeding. It represents a 


ARTICLE 


hub for trait isolation, understanding and exploiting natural genetic 
diversity and investigating the unique biology and evolution of one of 
the world’s first domesticated crops. 


METHODS SUMMARY 


Methods are available in the online version of the paper. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Building the physical map. BAC clones of six libraries of cultivar “Morex”””” were 
analysed by high information content fingerprinting (HICF)’”. A total of 571,000 
edited profiles was assembled using FPC v9.2° (Supplementary Table 2) (Sulston 
score threshold of 107°, tolerance = 5, tolerated Q clones = 10%). Nine iterative 
automated re-assemblies were performed at successively reduced stringency 
(Sulston score of 10” *° to 10-*°). A final step of manual merging of FPC contigs 
was performed at lower stringency (Sulston score threshold 10°) considering 
genetic anchoring information for markers with a genetic distance = + 5 cM. 
This produced 9,265 FPcontigs (approximately 14-fold haploid genome cov- 
erage) (Supplementary Table 2). 

Genomic sequencing. BAC-end sequencing (BES). BAC insert ends were 
sequenced using Sanger sequencing (Supplementary Note 2.1). Vector and quality 
trimming of sequence trace files was conducted using LUCY™ (http://www. jcvi.org/ 
cms/research/software/). Short reads (that is, < 100 bp) were removed. Organellar 
DNA and barley pathogen sequences were filtered by BLASTN comparisons to 
public sequence databases (http://www.ncbi.nlm.nih.gov/). 

BAC shotgun sequencing (BACseq). Seed BACs of the FPC map were sequenced 
to reveal gene sequence information for physical map anchoring. 4,095 BAC clones 
were shotgun sequenced in pools of 2 X 48 individually barcoded BACs on Roche/ 
454 GS FLX or FLX Titanium’'”’. Sequences were assembled using MIRA v3.2.0 
(http://www.chevreux.org/projects_mira.html) at default parameters with features 
‘accurate’, ‘454’, ‘genome’, ‘denovo’. An additional 2,183 gene-bearing BACs (Sup- 
plementary Note 3.2) were sequenced using Illumina HiSeq 2000 in 91 combin- 
atorial pools’*. Deconvoluted reads were assembled using VELVET’. Assembly 
statistics are given in Supplementary Table 4. 

Whole-genome shotgun sequencing. Illumina paired-end (PE; fragment size 
~350 bp) and mate-pair (MP; fragment size ~2.5 kb) libraries were generated 
from fragmented genomic DNA™ of different barley cultivars (“Morex’, ‘Barke’, 
‘Bowman’, ‘Igri’) and an S3 single-seed selection of a wild barley accession B1K- 
04-12° (Hordeum vulgare ssp. spontaneum). Libraries were sequenced by 
Illumina GAIIx and Hiseq 2000. Genomic DNA of cultivar “Haruna Nijo’ (size range 
of 600-1,000 bp) was sequenced using Roche 454 GSFLX Titanium chemistry. 
Whole-genome shotgun sequence assembly. PE and MP whole-genome shot- 
gun libraries were calibrated for fragment sizes by mapping pairs against the 
chloroplast sequence of barley (NC_008590) using BWA”. Sequences were qual- 
ity trimmed and de novo assembled using CLC Assembly Cell v3.2.2 (http:// 
www.clcbio.com/). Independent de novo assemblies were performed from data 
of cultivars ‘Morex’, ‘Bowman’ and ‘Barke’. 

Transcriptome sequencing. Eight tissues of cultivar “Morex’ (three biological 
replications each) earmarking stages of the barley life cycle from germinating 
grain to maturing caryopsis were selected for deep RNA sequencing (RNA-seq). 
Plant growth, sampling and sequencing is detailed in Supplementary Information 
(Supplementary Note 6). Further mRNA sequencing data was generated from 
eight additional spring barley cultivars within a separate study and was used here 
for sequence diversity analysis (Supplementary Note 8.2). 

Genetic framework of the physical map. The genetic framework for anchoring 
the physical map of barley was built on a single-nucleotide variation (SNV) map” 
(Supplementary Note 4.3) which provided the highest marker density (3,973) and 
resolution (N = 360, RIL/F8) for a single bi-parental mapping population in 
barley. Additional high-density genetic marker maps (Supplementary Note 4.3) 
were compared and aligned on the basis of shared markers. Furthermore, we used 
genotyping-by-sequencing (GBS)°* to generate high-density genetic maps com- 
prising 34,396 SNVs and 21,384 SNVs as well as 241,159 and 184,796 dominant 
(presence/absence) tags for the two doubled haploid populations Oregon Wolfe 
Barley’ and Morex X Barke”, respectively. Altogether 498,165 marker sequence 
tags were used (Supplementary Table 11). 

Genetic anchoring. Genetic integration of the physical map involved procedures 
of direct and indirect anchoring. 

Direct anchoring. Genetic markers were assigned to BAC clones/BAC contigs 
by three different procedures (Supplementary Note 4.3 and Supplementary Table 
9). 2,032 PCR-based markers from published genetic maps” were PCR-screened 
on custom multidimensional (MD) DNA pools (http://ampliconexpress.com/) 
obtained from BAC library HVVMRXALLeA’. A single haploid genome equi- 
valent of these MD pools was used for multiplexed screening of 42,302 barley EST- 
derived unigenes represented on a custom 44K Agilent microarray as previously 
described®. 27,231 barley unigenes, comprising 1,121 with a genetic map position”, 
could be assigned to 12,313 BACs. 14,600 clones from BAC library HVVMRXALLhA 
were screened with 3,072 SNP markers on Illumina GoldenGate assays” leading to 
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1,967 markers directly assigned to BACs'*; approximately one third of this 
information has been included in the present work. 

Indirect anchoring. Sequence resources associated with the FPCmap frame- 

work provided the basis for extensive in silico integration of genetic marker 
information (Supplementary Note 4.3 and Supplementary Table 11). Repeat 
masked BES sequences, sequences of anchored markers and 6,295 sequenced 
BACs allowed integration of 307 Mb of ‘Morex’ whole-genome shotgun contigs 
into the FPC map. Genetic markers and barley gene sequences were positioned to 
this reference by strict sequence homology association. Overall 8,170 (~4.6 Gb) 
BAC contigs received sequence and/or anchoring information (Supplementary 
Note 4). 4,556 FPC contigs (X = 3.9 Gb) were anchored to the genetic framework. 
Analysis of repetitive DNA and repeat masking. Repeat detection and analysis 
was undertaken as previously described'*”® with the exception of an updated 
repeat library complemented by de novo detected repetitive elements from barley 
(Supplementary Note 5). 
Gene annotation, functional categorization and differential expression. 
Publically available barley full-length cDNAs and RNA-seq data generated in 
the project (Supplementary Note 6) were used for structural gene calling (Sup- 
plementary Note 7). Full-length cDNAs and RNA-seq data were anchored to repeat 
masked whole-genome shotgun sequence contigs using GenomeThreader® and 
CuffLinks®, respectively, the latter providing also information of alternatively 
spliced transcripts. Structural gene calls were combined and the longest ORF for 
each locus was used as representative for gene family analysis (Supplementary 
Note 7.1.2). 

Gene family clustering was undertaken using OrthoMCL (Supplementary Note 
7.1.3) by comparing against the genomes of Oryza sativa (RAP2), Sorghum bicolor, 
Brachypodium distachyon (v 1.4) and Arabidopsis thaliana (TAIR1O0 release). 

Analysis of differential gene expression (Supplementary Note 7.2) was per- 

formed on RNA-seq data using CuffDiff®. 
Analysis of sequence diversity. Genome-wide SNV was assessed by mapping 
(BWA v0.5.9-r16°°) the original sequence reads of sequenced genotypes to a de 
novo assembly of cultivar ‘Morex’. Sequence reads from RNA-seq were mapped 
against the ‘Morex’ assembly. Details are provided in Supplementary Note 8. 
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MRI presents microbial vitamin B 
metabolites to MAIT cells 


Lars Kjer-Nielsen', Onisha Patel”, Alexandra J. Corbett’, Jerome Le Nours””, Bronwyn Meehan!, Ligong Liu’, Mugdha Bhati’, 
Zhenjun Chen!, Lyudmila Kostenko', Rangsima Reantragoon!, Nicholas A. Williamson, Anthony W. Purcell”, 

Nadine L. Dudek*, Malcolm J. McConville’, Richard A. J. O’Hair®, George N. Khairallah®, Dale I. Godfrey’, David P. Fairlie’, 
Jamie Rossjohn?*’* & James McCluskey'* 


Antigen-presenting molecules, encoded by the major histocompatibility complex (MHC) and CD1 family, bind peptide- 
and lipid-based antigens, respectively, for recognition by T cells. Mucosal-associated invariant T (MAIT) cells are an 
abundant population of innate-like T cells in humans that are activated by an antigen(s) bound to the MHC class I-like 
molecule MR1. Although the identity of MR1-restricted antigen(s) is unknown, it is present in numerous bacteria and 
yeast. Here we show that the structure and chemistry within the antigen-binding cleft of MR1 is distinct from the MHC 
and CD1 families. MR1 is ideally suited to bind ligands originating from vitamin metabolites. The structure of MR1 in 
complex with 6-formyl pterin, a folic acid (vitamin B9) metabolite, shows the pterin ring sequestered within MR1. 
Furthermore, we characterize related MR1-restricted vitamin derivatives, originating from the bacterial riboflavin 
(vitamin B2) biosynthetic pathway, which specifically and potently activate MAIT cells. Accordingly, we show that 
metabolites of vitamin B represent a class of antigen that are presented by MRI for MAIT-cell immunosurveillance. As 
many vitamin biosynthetic pathways are unique to bacteria and yeast, our data suggest that MAIT cells use these 


metabolites to detect microbial infection. 


MATT cells, which comprise up to 10% of the peripheral blood T-cell 
population of humans, are readily detected in blood, mesenteric 
lymph nodes and the gastrointestinal mucosa’. Despite the abundance 
of this innate-like T-cell population, the role of MAIT cells in health 
and disease is unclear, although they can have crucial functions in 
protective immunity*’. Indeed, MAIT cells are rapidly activated by a 
wide range of microorganisms, including diverse strains of bacteria 
and yeast, suggesting that MAIT cells respond to a conserved anti- 
gen(s) common to these microbes*®. MAIT cells are activated by 
means of their «8 T-cell antigen receptor (TCR), which, analogous 
to the natural killer T (NKT)-cell antigen receptor’®, consists of an 
invariant TCR o chain paired with a limited array of VB chains 
(Vot7.2Ju33 paired with VB2 or VB13)””°. The constrained gene usage 
of the MAIT-cell antigen receptor (MAIT TCR), which is distinct 
from that of the NKT-cell antigen receptor (Vu24Ju18-VB11 in 
humans), suggests that MAIT cells target a key, albeit limited and 
atypical, class of antigen®. Providing further evidence for highly con- 
served MAIT ligand(s), mutagenesis studies of MAIT TCRs with 
different VB segments showed that a cluster of conserved TCR resi- 
dues is crucial for MAIT TCR recognition of diverse microbes’. The 
MAIT TCR is restricted to the monomorphic MHC class I (MHC-I)- 
like related molecule (MR1), which is highly conserved in mammals 
and encoded by a single gene that is not associated with the MHC’. 
MRI transcripts are ubiquitously expressed in all cell types, and the 
MRI primary structure is highly conserved across species, thereby 
suggesting a key, evolutionarily conserved function for MRI in 
immunity’*"*. Endogenous cell surface expression levels of MR1 are 


typically very low, suggesting that quantitatively limiting antigen(s) 
may be required to increase MR1 presentation’*’’. MRI shares con- 
siderable sequence similarity with the MHC-I and CD1 families— 
antigen-presenting molecules that are ideally suited to bind peptide- 
and lipid-based antigens, respectively. However, the precise identity 
of the MR1-restricted antigen(s), representing an important question 
in MAIT-cell biology, is unknown. 


Identification of an MR1-restricted ligand 


Yeast and many, but not all, bacterial species can activate MAIT cells 
in an MR1-restricted manner, suggesting that a closely related series 
of antigen(s) common to these microbes specifically bind MR1 
(ref. 3). MHC-I and MHC-I-like molecules are usually stable only 
in the presence of ligand, and so, unsurprisingly, MR1 could not 
be refolded efficiently in refolding buffer alone, consistent with 
MRI requiring a specific antigen(s) for its assembly and stability 
(not shown). Reasoning that recovery of properly assembled MRI- 
B2-microglobulin (82m) complexes would indicate efficient capture 
of a ligand(s), we established refolds of denatured human MRI and 
B2m with different sources of candidate ligands, including lipid-based 
ligands that were previously proposed to bind MRI (ref. 16). Re- 
folding in the cell culture medium RPMI-1640 (a control) enhanced 
the yield of MR1-B2m complexes, suggesting the presence of trace 
amounts of MR1 ligand(s) in this medium. RPMI-1640 is a defined 
culture medium containing several vitamin supplements, some of 
which are uniquely synthesized in bacteria and plants, but not mam- 
mals. Hence, we tested the refolding of MR1 in the presence of various 
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vitamin sources. We observed that folic acid sources (including 
RPMI-1640 medium, vitamin B complex tablets, or folic acid itself) 
all notably enhanced yields of MR1 purified by subsequent chro- 
matographic steps (Fig. 1a). The chromatographic properties of the 
refolded MR1-antigen closely matched those of refolded peptide- 
MHC-I (ref. 17), with a molecular mass of approximately 44 kDa, 
consistent with a monomeric MR1-B2m complex (Fig. 1a). Moreover, 
the refolded material reacted with an anti-MR1 monoclonal antibody 
(Fig. 1b), indicating that a vitamin-derived compound(s) enabled the 
proper refolding of MR1. 

Analysis of MR1 refolded with folic acid by negative mode electro- 
spray ionization-time-of-flight mass spectrometry (ESI-TOF-MS), 
exclusively revealed a ligand with a mass to charge (m/z) ratio of 
190.03. Comparison of this species with the Scripps Metlin metabolite 
mass spectrometry database (http://metlin.scripps.edu/), as well as 
further tandem mass spectrometry analysis, suggested that this spe- 
cies was 6-formy] pterin (6-FP) (Fig. 1c, d). ESI-TOF-MS analysis of 
MRI refolded with pure, chemically synthesized 6-FP showed an 


identical species to that found in MRI refolded with folic acid 
(Fig. 1d). The species giving m/z 190.03 by ESI-TOF analysis of 6-FP 
and MRI1-6-FP had identical liquid chromatography retention times 
and product ions after tandem mass spectrometry analysis (Fig. 1d and 
Supplementary Fig. 1). Notably, 6-FP is produced by the photodegra- 
dation of folic acid'’, and folic acid contained trace amounts of 6-FP as 
measured by ESI-TOF-MS. Moreover, chromatographically purified 
folic acid that was depleted of 6-FP, and subsequently promptly used 
for refolding with MR1, still yielded the MR1-6-FP complex (data 
not shown), indicating that photodegradation of folic acid was rapid. 
Furthermore, 6-FP specifically upregulated cell surface expression of 
MRI on human lymphoid C1R cells (Supplementary Fig. 2). These 
studies provided evidence that MRI can bind to small molecule organic 
compounds, such as 6-FP, that are derived from vitamin metabolism. 


The structure of MR1-antigen 


To gain insight into the detailed architecture of the MR1 molecule, we 
expressed and refolded MR1 in complex with 6-FP and subsequently 
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Figure 1 | Refolding of MR1 in the presence of 6-FP. a, Left, refolded MR1 
separated by 15% SDS-—polyacrylamide gel electrophoresis. Right, size- 

fractionated MR1 (bottom) and control HLA-B8 (top). mAU, milli-absorbance 
units. b, ELISA with refolded MR1 or control MAIT TCR. Data are mean and 
s.d. from triplicate samples (n = 3). c, 6-FP. d, Mass spectrometry analysis of 
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MRI1-6-FP. Left, compound at m/z 190.0347 observed with MR1-6-FP (top) 
and 6-FP (190.0328) (bottom). Right, product ions obtained from targeted 
fragmentation of the m/z 190.0347 species observed with MR1-6-FP (top) and 
of the m/z 190.0328 species of 6-FP (bottom); precursor ions indicated by black 
diamonds. Background counts were obtained with control buffer-only samples. 
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determined the structure of the MR1-6-FP complex (Supplementary 
Table 1 and Supplementary Fig. 3) and compared it with representative 
peptide- and lipid-binding antigen-presenting molecules, namely HLA- 
A2 and CD1d, respectively. 

MRI adopted a standard MHC-I fold (Fig. 2a). The MRI heavy 
chain shares close structural homology with HLA-A2 (39% sequence 
identity, root mean square deviation (r.m.s.d.) of 1.73 A over 213 Ca 
atoms) and CD1d (22% sequence identity, r.m.s.d. 2.26 A over 151 Cu 
atoms)'*”°. MRI was most closely related to an avian monomorphic 
MHC-I like molecule (43% sequence identity, r.m.s.d. 1.77 A over 276 
Ca atoms) that is thought to bind non-peptide-based antigens” 
(Supplementary Fig. 4). The «1-«2 domains formed the MRI anti- 
gen-binding cleft, which comprises two long a-helices sitting atop a 
B-sheet, akin to HLA-A2 and CD1d (Fig. 2b). The helices of the MR1 
antigen-binding cleft were not closely juxtaposed, as was observed for 
the human hemochromatosis (HFE) protein, an MHC-I-like mole- 
cule that does not bind antigen” (Fig. 2b). Indeed, the positioning of 
the «1 and «2 helices of MRI more closely resembled HLA-A2 than 
that of the more constricted CD1d antigen-binding cleft (r.m.s.d. 
1.0 A over 133 Co atoms and 3.1 A over 99 Co atoms, respectively) 
(Fig. 2b). However, the central cleft of MR1 is not suited, either chem- 
ically or structurally, to accommodate peptide- or lipid-based anti- 
gens (Fig. 3a). Namely, the HLA-A2 cleft is solvent exposed and 
mostly polar, thereby ideally suited for binding peptides”; whereas 
the CD1d cleft binds lipids by means of a hydrophobic-lined cavity 
shielded from solvent”. By comparison, the MRI antigen-binding 
cleft was mostly solvent exposed, consisting of a mixture of charged 
and hydrophobic residues, of which a preponderance of aromatic 
residues within the «1 and «2 helices was evident (Fig. 3a and 
Supplementary Fig. 5). Furthermore, the central cavity of MRI 
(760 A*) was much smaller than that of CD1d (1,690 A) (Fig. 3b). 
Although MHC-I molecules contain a conserved network of residues at 
the amino- and carboxy-terminal ends of the antigen-binding groove 
that tethers the termini of antigenic peptides”, the corresponding loca- 
tions within the MRI cleft, although showing some conservation, were 
different from MHC-I (Fig. 3c). Furthermore, the end of the MRI 
groove is not ‘open’, as observed for MHC class II molecules”. 
Whereas MHC-I comprises six pockets that accommodate the side 
chains of the peptide, these are not present in MRI (ref. 25). Instead, 
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a large number of bulky side chains occupied the entire length and 
breadth of the cleft, and it is this architecture that probably prevented 
the helical jaws of MRI packing closely together (Fig. 3c). Accordingly, 
the structure and chemical properties of the MR1 antigen-binding cleft 
were distinct from that of peptide- and lipid-based antigen-presenting 
molecules. 


Mode of MR1-antigen presentation 


6-FP was located centrally within the MRI cleft, positioned towards 
the base of the B-sheet (Figs 2a and 3d). The pterin ring lies relative- 
ly flat against the B-sheet, and in a different location to abacavir, 
a bicyclic compound recently found to bind HLA-B*57:01 (ref. 26) 
(Supplementary Fig. 6). 6-FP exhibited very limited solvent accessi- 


bility, with 317A? of the available 327 A? being buried by MRI 
(Fig. 3a). Binding to 6-FP was dominated by hydrophobic interac- 
tions, with Tyr 7, Tyr 62, Trp 69 and Trp 156 forming an ‘aromatic 
cradle’ that sequestered the ligand. In addition, the ligand formed van 
der Waals interactions with Arg 9, Arg 94, Ile 96 and Gln 153 (Fig. 3d) 
(Supplementary Table 2). Notably, a previous mutational study on 
MRI implicated Tyr 7, Arg9 and Arg 94 in MAIT-cell activation". 
There was clear evidence for a covalent bond between the Lys 43° 
and the formyl group of 6-FP (Supplementary Fig. 3), indicating that, 
during purification and/or crystallization, Lys 43 had formed a Schiff 
base with the formyl group. An engineered Lys43Arg MRI mutation 
failed to refold in the presence of 6-FP (not shown), highlighting the 
importance of the Lys 43-6-FP interaction. Adjacent to the MRI- 
antigen-binding pocket there were two positively charged residues 
(Arg 9 and Arg 94) protruding up into the cleft, suggesting a require- 
ment for polar moieties in other potential MR1-restricted ligands. Of 
note, the residues in contact with 6-FP (and the two Arg residues) are 
conserved across MRI from all species (Supplementary Fig. 7), sug- 
gesting that recognition of ligands within the MR1-binding pocket is 
highly conserved. Hence, the structure of the MR1-6-FP complex 
shows how MR1 is ideally suited to present small organic compounds 
that can originate from vitamins. 


MRI1-restricted MAIT activation 
Although 6-FP is a ligand for MRI, it did not activate Jurkat cells trans- 
duced with a MAIT TCR (termed Jurkat.MAIT cells) (Fig. 4a) or 


Figure 2 | Overview of the crystal structure of 
MRI-antigen complex. a, Structure of MR1. The 
a1, #2 and «3 domains of MRI non-covalently 
attached to Bom. «1, «2 and «3 are in cyan; Bom is 
in light orange; and 6-FP is in magenta. b, Overlay 
of the «1 and «2 helices using the residues within 
the antigen-binding cleft of MR1 with HFE (left, 
PDB code 1A6Z), MHC-I (middle, HLA-A2, PDB 
code 3GSO) and CD1d (right, PDB code 1ZT4). 
MR_1 is in cyan; HFE is in yellow; MHC-1 is in red; 
CD1d is in magenta. 
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Figure 3 | Comparison of MR1, MHC-I and MHC-I-like binding clefts. 

a, Electrostatic surface of MR1, HLA-A2 and CD1d antigen-binding clefts. 
Peptide, green; lipid, cyan. A’ and F’ are binding pockets within CD1d. b, Side 
(left) and top (right) views of MR1-6-FP and CD1d-«-galactosylceramide. 

¢, Left, peptide interaction with HLA-A2 residues; right, corresponding MR1 
residues. Six pockets (A-F) of MHC-I, coloured in orange, red, cyan, green, 


primary MAIT cells (Fig. 5c, e), suggesting pterin analogues might pro- 
vide a basic structural scaffold for another ligand, or class of ligand, 
capable of activating MAIT cells. Notably, antigen-presenting cells trans- 
fected with MR1 and infected with Salmonella typhimurium specifically 
activated Jurkat.MAIT cells’, but not Jurkat cells expressing a control 
MHC-restricted TCR (Supplementary Fig. 8a, b). Furthermore, we 
found that the supernatant from S. typhimurium was able to activate 
Jurkat.MAIT and MAIT cells (Figs 4a and 5a-c, e), and we reasoned that 
identification of the bacterial-activating ligand(s) from S. typhimurium 
supernatant might be facilitated, if MR1-B2m could be refolded in 
supernatant from Salmonella grown in M9 minimal media lacking vit- 
amin supplements and their derivatives, which might compete for MR1 
binding. Therefore we refolded MR1 and searched for ligands that com- 
plexed with MR1 only in supernatant from Salmonella grown in min- 
imal medium. Using this approach, a single compound at m/z 329.1100 
was identified by high-resolution ESI-TOF-MS. This was exclusively 
present in MRI refolded in the presence of supernatant of Salmonella 
grown in M9 minimal media (that is, lacking vitamin supplements) 
(Fig. 4b, c). Other MRI1 ligands specifically derived from the 
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Top view 
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magenta and brown, respectively; peptide is in green. The surface in b and c is 
transparent to show 6-FP binding. d, Top (left) and side (right) views of MR1- 
6-FP. MRI residues that contact 6-FP are in yellow. H-bond and van der Waals 
interactions are shown in black- and red-dashed lines, respectively. MR1 is in 
cyan; 6-FP is in magenta. 


Salmonella supernatant were not identified in the mass range from 50 
to 1,000 amu (Supplementary Fig. 9). This compound was assigned an 
unambiguous atomic composition of C},HigN4O7 on the basis of both 
its molecular ion and its component isotopic mass distribution pattern. 
A search for potential matching compounds suggested one of the 
derivatives of riboflavin (vitamin B2); reduced 6-hydroxymethyl-8-p- 
ribityllumazine (rRL-6-CH,OH) (C),H;gN407), 7-hydroxy-6-methyl- 
8-p-ribityllumazine (RL-6-Me-7-OH) (C)2Hi6N4O7) and its precursor, 
6,7-dimethyl-8-p-ribityllumazine (RL-6,7-diMe) (C)3H)6N4O,) are 
candidate MAIT-activating ligands (Fig. 4d). 

To establish formally whether rRL-6-CH,OH, RL-6-Me-7-OH and/ 
or RL-6,7-diMe represented MR1 ligands and could activate MAIT 
cells, we chemically synthesized and biochemically characterized these 
compounds. Analysis by ESI-TOF-MS of pure, chemically synthesized 
rRL-6-CH,OH exclusively revealed a species identical to that found 
in MR1 refolded with Salmonella supernatant (Fig. 4b, c). Namely, 
ESI-TOF analysis identified a species with an m/z ratio of 329.1100 
in the MR1 Salmonella supernatant that matched precisely with synthetic 
rRL-6-CH,OH (found 329.1116, calculated for C},H,7N4O7 329.1103, 
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Figure 4 | Identification of bacterially-derived 
MAIT-cell antigens. a, CD69 expression of 
Jurkat.MAIT and control Jurkat.LC13 cells with 
6-FP or Salmonella supernatant (Salm. S/N), and 
C1R.MRI cells; blocking by an anti-MRI1 
monoclonal antibody. APC, allophycocyanin; 
MFI, mean fluorescent intensity. Activation of 
control Jurkat.LC13 cells by FLERGRAYGL (FLR) 
peptide with C1R.HLA-B8 cells. Data are 

mean ands.e.m. from triplicate samples (n = 3). 
b, Extracted ion chromatograms (EIC) showing 
retention times of the m/z 329.1000 species from 
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[M-H] ). Moreover, they showed identical isotopic mass distribution 
patterns (confirming atomic composition), identical fragmentation 
patterns by tandem mass spectrometry analysis, and had identical liquid 
chromatography-mass spectrometry (LC-MS) column retention 
times (Fig. 4b-d). The structures of tandem mass spectrometry frag- 
ment ions provide important evidence for the assigned structure of 
rRL-6-CH,OH, which undergoes dehydration/tautomerism to give 
m/z 311.1052 (6-methyl-8-p-ribityllumazine) or sequential dehyd- 
ration/ribityl side-chain scission to give m/z 191.0586 (Fig. 4c, d). 
The structures of rRL-6-CH,OH, RL-6-Me-7-OH and RL-6,7-diMe 
are closely related to 6-FP, but possess an extra ribityl moiety that, 
based on the crystal structure of MR1-6-FP, may permit direct contact 
by the MAIT TCR. Notably, these compounds are derived from the 
riboflavin biosynthetic pathway present in most, but not all, bacteria 
and yeast (Supplementary Table 3). 

We tested the ability of these compounds to activate Jurkat. MAIT 
cells and human MAIT cells from peripheral blood (Fig. 5). Although 
the ribityl lumazines failed to activate the control Jurkat.LC13 cell line 
in the presence of MR1-expressing C1R cells (Fig. 5a), they specifically 


activated three Jurkat.MAIT-cell lines (transduced with TRBV6.1, 
TRBV6.4 and TRBV20 MAIT TCRs) in the presence of C1R cells 
expressing MRI (Fig. 5a), whereas riboflavin did not (not shown). 
An anti-MR1 blocking monoclonal antibody” specifically inhibited 
the riboflavin metabolite-mediated Jurkat.MAIT activation (Fig. 5b 
and Supplementary Fig. 10), and rRL-6-CH,OH upregulated MR1 
cell surface expression on C1R cells (Supplementary Fig. 2). All com- 
pounds specifically activated freshly isolated MAIT cells as defined by 
CD3* CD4” CD161* TRAV1.2~ (monoclonal antibody D5"), but 
not other D5" CD161_ or D5" CD161~ CD3°* cells (Fig. 5c-e and 
Supplementary Figs 11 and 12). Activation was assayed by CD69 
upregulation (Fig. 5c, d) and intracellular cytokine staining for inter- 
feron (IFN)-y and tumour necrosis factor (TNF) (Fig. 5e). Notably, 
rRL-6-CH,OH, although closely related to RL-6-Me-7-OH and RL- 
6,7-diMe, was a much more potent MAIT agonist as judged by 
Jurkat.MAIT-cell and MAIT-cell activation. This indicates that there 
are several riboflavin-based metabolites that exhibit a broad spect- 
rum in their ability to activate MAIT cells. The enzymatic pathway 
that generates these riboflavin precursors only seems to be found in 


29 NOVEMBER 2012 | VOL 491 | NATURE | 721 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Figure 5 | MAIT-cell activation with MR1- 
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microbes that are capable of activating MAIT cells and is absent in 
non-activating microbes (Supplementary Table 3). Accordingly, 
direct precursors and metabolites of riboflavin biosynthesis clearly 
activate MAIT cells. 


Discussion 

T cells expressing x TCRs interact with diverse antigens presented by 
MHC-I and MHC-I-like molecules’’. Indeed, the cellular arm of the 
immune system uses an array of polymorphic and monomorphic 
antigen-presenting molecules to provide collectively broad awareness 
of signature microbial products with distinct chemistries, thereby 
enabling self from non-self discrimination and facilitating protective 
immunity. Our findings provide evidence that this antigenic arsenal is 
extended by the capacity of MRI to present small organic compounds 
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derived from biosynthetic intermediates to, or chemical metabolites 
of, endogenously synthesized vitamins. 

MATT cells are activated, in an MR1-dependent manner, by a broad 
spectrum of bacteria and yeast, but not by viruses or certain bacteria’. 
Although a folic acid (vitamin B9) derivative (6-FP) could bind MR1, 
it was non-stimulatory for MAIT cells, suggesting that this ligand 
represents a structural scaffold for binding MR1, but requires a fur- 
ther moiety to engender MAIT-cell activation. Accordingly, we 
showed that riboflavin (vitamin B2) derivatives, larger in size but 
closely related structurally to 6-FP, could bind MRI and activate 
MAIT cells. A key distinction between the microbes that are stimu- 
latory and non-stimulatory to MAIT cells is that the former synthesize 
riboflavin, whereas the latter do not. These observations suggest a 
mode by which MAIT cells might sense microbial infection or 
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overgrowth at mucosal sites in an MRI1-restricted manner. Of note, 
these vitamin-based metabolites are secreted and are diffusible, sug- 
gesting a mechanism by which MAIT cells might sense bacterial 
activity across mucosal membranes potentially regulating local 
immunity and mucosal barrier functions. Our findings highlight that 
bacterially produced vitamin-based metabolites can exhibit a broad 
spectrum of MAIT-cell activation. Indeed, the identification of both 
activating and non-activating MRI ligands suggests competition 
between these agents thus modulating MAIT-cell activity in the con- 
text of bacterial infection. Moreover, as the pterin ring occurs widely 
in nature, and represents a common scaffold of small molecule ther- 
apeutics, it will be of interest to establish whether the MAIT cell-MR1 
axis is perturbed in any pathological or drug-induced conditions; how 
diet can influence MAIT activity; and whether other microbial meta- 
bolites represent MAIT-cell ligands capable of modulating the func- 
tion of these cells. Clearly, there is a close relationship between the 
human immune system, host fitness and gut microbiota metabo- 
lites*~°. Defining MAIT-cell-activating ligands represents a fun- 
damental new advance that will be pivotal in understanding the 
physiological and pathological role of MAIT cells, a very abundant 
population of innate-like T cells associated with the gastrointestinal 
mucosa. Our findings also suggest that other microbial-specific meta- 
bolites*? may serve as molecular signatures of microbial infection that 
undergo immunosurveillance. 


METHODS SUMMARY 

Refolding MR1 with 6-FP. This was performed using a method similar to that 
established for MHC-I refolding. Synthesized 6-FP and 6,7-dimethyl-pterin were 
purchased from Schirks Laboratories, folic acid was purchased from Sigma- 
Aldrich. Further details are provided in the Methods. 

Analysis of MR1-6-FP by mass spectrometry. MR1-6-FP (4 1g) was loaded 
onto an XBridge C18 reversed phase column (Waters) in 20mM ammonium 
acetate, pH 5.4, buffer, and detected in an Agilent ESI-TOF mass spectrometer 
after elution in an acetonitrile gradient. Data was collected in negative ion mode. 
Synthetic 6-FP (0.4 tg) was analysed under the same conditions. 
Crystallization and data collection. The MR1-6-FP complex was crystallized 
and the structure determined as described in the Methods. 

Analysis of MR1-bacterial supernatant by mass spectrometry. See Methods for 
full details. 

Synthesis of MR1 antigens. Chemical synthesis of RL-6,7-diMe, RL-6-Me-7-OH 
and reduced rRL-6-CH,OH is described in Methods and Supplementary Fig. 13. 
Activation of Jurkat.MAIT and MAIT cells. The activation of Jurkat.MAIT cells 
was performed as described’. Further details are provided in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Preparation of denatured inclusion body MR1 and $2m. The method for 
refolding and purifying the MR1-B2m-ligand complex is based on a similar 
methodology used for classical MHC heavy chain-fh2m-peptide complex 
described previously”. Genes encoding soluble human MRI (residues 1-270 of 
the mature, leaderless protein, lacking the transmembrane and cytoplasmic 
domains); or human B2m (mature, leaderless protein) were expressed for 4h 
in BL21 E. coli after induction with 1 mM isopropyl B-p-1-thiogalactopyranoside. 
E. coli cells were pelleted and resuspended in a buffer containing 50 mM Tris, 25% 
(w/v) sucrose, 1 mM EDTA, 10 mM dithiothreitol (DTT), pH 8.0. Inclusion body 
protein was then extracted by lysis of bacteria in a buffer containing 50 mM Tris, 
pH8.0, 1% (w/v) Triton X-100, 1% (w/v) sodium deoxycholate, 100 mM NaCl, 
10mM DTT, 5mM MgCl, and 1 mg DNasel per litre of starting culture; and 
subsequent steps involved homogenization with a polytron homogenizer, cent- 
rifugation, and washing inclusion body protein sequentially with first a buffer 
containing 50 mM Tris, pH 8.0, 0.5% Triton X-100, 100 mM NaCl, 1 mM EDTA 
and 1 mM DTT, and second a buffer containing 50 mM Tris, pH 8.0, 1 mM EDTA 
and 1mM DTT. Inclusion body protein was then resuspended in a buffer con- 
taining 20 mM Tris, pH 8.0, 8 M urea, 0.5 mM EDTA and 1 mM DTT, and after 
centrifugation the supernatant containing solubilized, denatured inclusion body 
protein was collected and stored at —80°C. 

Refolding of MR1 and $2m with ligands. Two milligrams of 6-FP, 56 mg of 
denatured inclusion body MR1 protein, and 28 mg of 82m protein were added to 
a 400-ml refold buffer solution containing 0.1 M Tris, pH 8.5, 2 mM EDTA, 0.4M 
arginine, 0.5 mM oxidized glutathione and 5 mM reduced glutathione. Variably, 
5 M urea was either present or absent. After an overnight incubation at 4 °C, the 
refold buffer was dialysed against three changes of buffer containing 10 mM Tris, 
pH 8.0, over a period of 24h. The refolded MR1-B2m-ligand complex was then 
purified by sequential DEAE (GE Healthcare) anion exchange, $75 16/60 (GE 
Healthcare) gel filtration, and MonoQ (GE Healthcare) anion exchange chro- 
matography. Refolded and purified MRI was tested for structural integrity in an 
ELISA by reactivity with the MR1-reactive monoclonal antibody 26.5 (ref. 11), or 
the control monoclonal antibody 12H8 reactive with the TCR constant domain”. 
Mass spectrometry analysis of MR1 refolded with 6-FP and Salmonella super- 
natant. MR1 refolded with either 6-FP or Salmonella supernatant was analysed 
by ESI-TOF-MS, in negative ion mode, on an Agilent 6520 QTOF instrument. 
Samples were loaded onto a Waters Xbridge C18 reversed phase column (3.5 jim, 
2.1 X 100 mm) in a buffer containing 20 mM ammonium acetate, pH 5.4, and 
were eluted with a gradient of buffer B (containing 80% acetonitrile). Retention 
times of 6-FP (m/z 190.0347) and the m/z 329.1100 species from MR1 refolded 
with Salmonella supernatant were determined from extracted ion chromato- 
grams using the respective m/z values. Parent ions thus identified at respective 
retention times; as well as product ions (obtained from targeted fragmentation at 
collision-induced dissociation voltages of 10, 20 or 40 V) were then characterized. 
Controls: 6-FP, rRL-6-CH,OH, MRI refolded with control M9 media-only, and 
buffer-only controls, were included in relevant experiments. Note the nonlinear 
scale on the x axis in Figs 1d (right panel) and 4c (right panel). 

Crystallization, structure determination and refinement. MR1 (5-10mg 
ml ') crystallized at 294K in 0.02 M MgCl, 0.1M HEPES, pH7.5, and 22% 
polyacrylic acid 5100 sodium salt. Equal ratio of the protein to mother liquor 
resulted in plate-like crystals. The crystals were improved by seeding into 0.2 M 
NaCl, 0.1 M HEPES, pH 7.5, and 25% PEG 3350. Crystals were flash frozen before 
data collection using 35% PEG3350 as the cryoprotectant. The data was collected 
at 100 K on the 031D1 beamline at the Australian Synchrotron, Melbourne. The 
crystals of MRI diffracted to 3.2 A and belong to the space group P2,2;2), with 
two molecules within the asymmetric unit. The data was processed using Mosflm 
version 7.0.5 (ref. 33) and scaled using SCALA from the CCP4 suite**. The data 
were solved by the molecular replacement method using MOLREP in CCP4, with 
HLA-G (PDB code 1YDP (ref. 35)) without the peptide and loop region as a 
search model. The structure was refined using BUSTER 2.10 (ref. 36). Model 
building was carried out using COOT”. The overall structure was validated using 
MOLPROBITY* and Ramachandran plot showed 95% residues in the most 
favoured region with 0.1% outliers. All molecular graphics representations were 
created using PyYMOL”. Surface area calculations were done using the protein 
interfaces, surfaces and assemblies service PISA at European Bioinformatics 
Institute (http://www.ebi.ac.uk/pdbe/prot_int/pistart.html). Cavity volumes 
were calculated using Pocket Finder*®. The two MR1-6-FP complexes in the 
asymmetric unit are essentially identical (r.m.s.d. 0.5 A over 278 Cot atoms), 
and thus only one is described in the results. 

Synthesis of MR1 antigens. Compounds RL-6,7-diMe (1), RL-6-Me-7-OH (2) 
and a 2-electron reduced form of RL-6-CH,OH, namely rRL-6-CH2OH (3) were 
synthesized by modifications (Supplementary Fig. 13) to reported literature pro- 
cedures (see ref. 41 for review). Their purities were established using proton NMR 


spectra and reversed-phase HPLC traces. In brief, p-ribitylamine (6) was pro- 
duced in 55% yield from D-(—)-ribose via its oxime intermediate. Condensation 
with 4-chlorouracil (7) followed by nitrosation provided the key intermediate (9) 
in reasonable yield (27%). The diaminouracil (10) was unstable, and therefore it 
was generated in situ, immediately before use, by reduction of the nitroso group 
with sodium hydrosulphite. Condensation with the corresponding «,B-dicarbo- 
nyl reagents at the optimal pH under a nitrogen atmosphere in the dark gave the 
products RL-6,7-diMe (1) and RL-6-Me-7-OH (2) after purification by prepar- 
ative reversed-phase HPLC. The amount of «,B-dicarbonyl reagents significantly 
affected the product profiles. For dimethyl analogue 1, three equivalents of 2,3- 
butadione gave a much purer crude product than reported in the literature, in 
which 6.6 equivalents gave a significant amount of bis-adduct and made puri- 
fication difficult. Compound RL-6,7-diMe (1) was unstable, particularly in solu- 
tion. Thus prolonged reaction and work-up procedures were best avoided. By 
contrast, a large excess of sodium pyruvate (9-18 equiv.) was key for efficient 
production of the 7-hydroxy analogue. Similarly, condensation with excess 1,3- 
dihydroxyacetone dimer (a reduced form of «,f-dicarbonyl) gave directly the 
reduced derivative rRL-6-CH,OH (3) in low yield (5%), which was identified 
by NMR spectra (7H, °C, COSY, HSQC in DMSO-ds-D2O 10:1, (v/v)) with 
characteristic resonances for 6-hydroxymethyl ('H/'°C: 5 5.27, singlet/80.0) 
and reduced ring methylene at position-7 (‘H/'°C: 8 4.16 and 4.03, AB quartet, 
J = 13.7 Hz/62.6). The reduced derivative of RL-6,7-diMe (4) was prepared using 
excess sodium hydrosulphite (5 equiv.) and easily desalted and separated from the 
starting material using a cation-exchange column (Amberlite IR-120, H* form). 
However, this reduced form of RL-6,7-diMe (4) was readily oxidised by air and 
rapidly reverted to the original state, RL-6,7-diMe (1). Two diastereomers of the 
reduced form of RL-6,7-diMe (4) were partially separated using preparative 
reversed-phase HPLC to give enriched diastereomers (92:8 and 91:9), which were 
identified by NMR spectra (*H, °C, COSY, HSQC) with characteristic cross- 
coupled signals between the 7-methyl group (‘H/'°C: 8 1.16, doublet, J = 6.8 Hz/ 
14.4 and 6 1.17, doublet, J = 6.8 Hz/13.4) and the 7-methine CH (1H/*C: 6 4.35, 
quartet/57.8 and 6 4.31, quartet/55.5). 

Generation of the D5 monoclonal antibody. The D5 monoclonal antibody, 
reactive against human TCR Va7.2 (IMGT: TRAV1-2), was generated by 
immunizing a BALB/c mouse with soluble human MAIT TCR (using the 
MAIT V«7.2/Ju33 invariant chain paired with a VB13.3/TRBV6-1 B chain). 
The D5 monoclonal antibody was characterized, and shown to be reactive in 
an ELISA assay with soluble MAIT TCRs using the invariant TRAV1-2 « chain 
regardless of the pairing B chain (TRBV6-1, TRBV6-4, TRBV20); but was not 
reactive with the control TCR LC13 (using « chain TRAV26-2 and f chain 
TRBV7-8). Only SKW3 and Jurkat T-cell lines transduced with TCRs using a 
TRAV1-2 chain (including the ELS4 TCR”, which does not use the Ja33 
segment used by the invariant MAIT « chain) stained positive by indirect immu- 
nofluorescence. Thus, the D5 monoclonal antibody is specific for the Vo7.2/ 
TRAV1-2 « chain. This was confirmed by single-cell sorting of peripheral blood 
mononuclear cells (PBMCs) stained with the D5 monoclonal antibody, and 
subsequent RT-PCR using oligonucleotides specific for the Vo.7.2 gene segment, 
followed by sequencing of amplified DNA segments (data not shown). 
Blocking of MR1-mediated activation of cells expressing the MAIT TCR. The 
MRI1-reactive monoclonal antibody 26.5 (20 pg ml ‘sa gift from T. Hansen), or 
the isotype control monoclonal antibody W6/32 (reactive against human classical 
HLA class I molecules) was added to C1R antigen-presenting cells expressing 
MRI 1h before the addition of activating ribityl lumazine compounds, after 
which Jurkat T-cell lines expressing the MAIT TCR were added. After 18h T 
cells expressing the MAIT TCR were stained for upregulation of CD69 and 
analysed by flow cytometry. 

Activation of Jurkat.MAIT cells. Jurkat cells transduced with a MAIT TCR 
comprising the TRAV1-2-TRAJ33 invariant « chain, and the TRBV6-1, 
TRBV6-4 or TRBV20 f chains were tested for activation by addition of 
Salmonella supernatant to C1R antigen-presenting cells expressing MRI for 
16h. Jurkat.MAIT cells were subsequently stained with PE-conjugated anti- 
CD3, and APC-conjugated anti-CD69 antibodies before analysis by flow cyto- 
metry. Activation of Jurkat.MAIT cells was measured by an increase in surface 
CD69 expression. The ability of vitamin-based metabolites to activate Jurkat. 
MAIT cells was tested alongside Salmonella supernatant. 

Activation of MAIT cells. PBMCs from a healthy donor were mixed with CIR 
cells expressing MRI (10° each per well). Salmonella SL1344 supernatant (2 pl) 
from cultures grown in LB broth supplemented with 25 ug ml’ streptomycin 
(Sigma) for 4h at 37 °C, or compounds (RL-6,7-diMe, RL-6-Me-7-OH and 6-FP: 
76.2 1M final, rRL-6-CH,OH: 0.152 1M final) or PMA (2 ng ml plus ionomy- 
cin (1 ng ml *) added in a total volume of 220 j1l RF-10, and incubated overnight 
at 37 °C. Cells were stained with anti-CD3-PE-Cy7 (eBioscience, 20 ig ml}, 
anti-CD4—APC-Cy7 (Biolegend, 1.7 jig ml~ 1), anti-CD161-APC (Miltenyi Biotec, 
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1:50), FITC-conjugated D5 (anti-MAIT TCR, 10g ml), and anti-CD69-PE 
(BD Biosciences, 1:50), anti-IFN-y-PE (BD Pharmingen, 10 ugml') or anti- 
TNFo-PE (BD Pharmingen, 10 1g ml’), and analysed by flow cytometry using 
a Canto II cytometer and Diva software. For intracellular cytokine staining, bre- 
feldin A (10 pg ml ') was added to the assay after the first 1h and the incubation 
allowed to proceed overnight. Cells were fixed with 1% parformaldehyde after 
staining for surface markers and permeabilized with 1% saponin during cytokine 
stains. MAIT cells were defined as CD3* CD4~ CD161* D5" after gating on 
PBMCs using forward scatter and side scatter detectors. 
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Visualizing transient low-populated 


structures of RNA 


Elizabeth A. Dethoff'*, Katja Petzold'*, Jeetender Chugh'*, Anette Casiano-Negroni'+ & Hashim M. Al-Hashimi! 


The visualization of RNA conformational changes has provided fundamental insights into how regulatory RNAs carry out 
their biological functions. The RNA structural transitions that have been characterized so far involve long-lived species 
that can be captured by structure characterization techniques. Here we report the nuclear magnetic resonance 
visualization of RNA transitions towards ‘invisible’ excited states (ESs), which exist in too little abundance (2-13%) and 
for too short a duration (45-250 ps) to allow structural characterization by conventional techniques. Transitions towards 
ESs result in localized rearrangements in base-pairing that alter building block elements of RNA architecture, including 
helix—junction- helix motifs and apical loops. The ES can inhibit function by sequestering residues involved in recognition 
and signalling or promote ATP-independent strand exchange. Thus, RNAs do not adopt a single conformation, but rather 
exist in rapid equilibrium with alternative ESs, which can be stabilized by cellular cues to affect functional outcomes. 


Nuclear magnetic resonance (NMR) relaxation dispersion methods'”, 
which measure microsecond-to-millisecond conformational exchange, 
have made it possible to characterize the transient, low-populated 
excited state (ES) structures of proteins** and to establish their import- 
ance in catalysis’, folding*®, signalling’ and recognition®. These ESs exist 
in too little abundance (typically with populations <5%) and for too 
short a duration (lifetime < milliseconds) to allow structural charac- 
terization by conventional techniques. Recent advances that extend the 
timescale sensitivity of rotating frame (R,,) carbon relaxation disper- 
sion experiments have made it possible to characterize fully exchange 
processes in nucleic acids’""’, culminating in the discovery of ES 
Hoogsteen base pairs in DNA". Although evidence for RNA ESs has 
been reported for decades, their structure and role in function have 
remained elusive’**>. 

Here we report a strategy for characterizing the ES structures of RNA 
that combines R;, NMR experiments, mutagenesis and secondary struc- 
ture prediction. With this approach, we visualized ES structures for three 
distinct RNAs and obtained insights into their biological functions. 


ES structure of the HIV TAR apical loop 


We used a low spin-lock field Ri, NMR experiment’"’ to measure 
microsecond-to-millisecond conformational exchange at sugar (C1’) 
and nucleobase (C8 and C6) carbon sites in the well studied hexanu- 
cleotide apical loop of the transactivation response element (TAR)'® 
from the human immunodeficiency virus type-1 (HIV-1). The TAR 
apical loop is a flexible recognition site that allows adaptive binding to 
a variety of proteins’’. We observed conformational exchange (Fig. 1b 
and Supplementary Fig. 1) at carbon sites spread throughout the 
entire TAR apical loop (Fig. Hid The Rj, data could be collectively 


fitted to a two-state (se ES) exchange process (where GS 


indicates ground state) that is rauested towards an ES with population 
Prs ~13% and lifetime (tgs = 1/k—1) ~45 bs (Supplementary Table 1). 
A slower exchange process is observed at C1’ of G33 (G33-C1'), G33-C8 
and A35-C8 (pps <1% and ts = 1.9-2.3 ms), which can be assigned to 
a distinct higher energy ES that will not be discussed further (Sup- 
plementary Discussion and Supplementary Fig. 5). 


In the ground state (GS), apical loop residues exist in equilibrium 
between C2’-endo and C3’-endo sugar puckers, G34 forms a flexible 
cross-loop C30°G34 Watson-Crick (WC) base pair, whereas the 
bases of U31, G32 and A35 are flexible’*"’. To gain insights into the 
ES structure, we examined the sugar and base ES carbon chemical 
shifts (Ms) obtained from the two-state analysis of the R,, data, 
which are sensitive reporters of base stacking, sugar pucker and syn 
versus anti glycosidic angles*”. The downfield-shifted sugar ES C30- 
C1’, U31-Cl’ and A35-Cl’ chemical shifts strongly suggest that in the 
ES these residues adopt a pure C3’-endo sugar pucker characteristic of 
a helical conformation (Fig. la and Supplementary Table 1). The 
downfield-shifted base ES G34-C8 can unambiguously be assigned 
to a syn base”! (Supplementary Discussion) and has a chemical shift 
that is highly characteristic of a UUCG tetraloop, which features a 
trans-wobble GeU base pair (underlined) and a syn base (italics)**. 
Notably, TAR can accommodate a similar U3;G32G33G34 tetraloop. 
This places G34 in a syn position, where it can base pair with U31, 
thus explaining exchange at U31-C6. It also leads to the formation of 
C30¢A35 and U31¢G34 non-canonical closing base pairs, explaining 
the helical conformation observed for these residues in the ES. 
Transitions towards this ES require disruption of the cross-strand 
C30¢G34 base pair, explaining the measured activation free energy 
(12.6 kcal mol” ') (Supplementary Fig. 2), which is at the low end 
of the free energy range required to open RNA WC base pairs 
(13-16 kcal mol ')”’. This ES is also predicted to be the second most 
energetically favourable conformation using the secondary structure 
prediction program MC-Fold** (Supplementary Fig. 3). 

Weused a ‘mutate-and-chemical-shift-fingerprint’ (MCSF) strategy 
to test the proposed TAR ES. Here, a mutation or chemical modifica- 
tion is introduced to stabilize (or destabilize) a candidate ES, and the 
mutant’s NMR carbon chemical shift fingerprints are compared with 
those of the ES (or GS). We stabilized the proposed TAR ES using two 
point mutations, C30U (TAR(C30U)) and A35G (TAR(A35G)), that 
replace the ES C30eA35 non-canonical base pair with more stable 
WC U30¢A35 and C30°G35 base pairs, respectively (Fig. 1c). Both 
mutants adopted the proposed ES structure, as confirmed by NMR 
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Figure 1 | Excited-state structure of the HIV-1 TAR apical loop. a, GS and 
ES structures of TAR. ES chemical shifts indicating increased stacking and/or 
anti glycosidic angles and C3'-endo sugar pucker are blue; decreased stacking 
and/or syn glycosidic angles and non-C3'-endo sugar pucker are red. Sites with 
little to no fast exchange are grey. b, Example relaxation dispersion profile 
showing dependence of R, + Rx on spin-lock power (@,.™/2m) and offset 


(Supplementary Fig. 4), and relative to wild-type TAR both mutants 
featured large changes in the carbon chemical shifts, specifically at 
sites showing fast exchange in wild-type TAR, that are directed towards 
the ES chemical shifts (Fig. 1d). Inversely, we destabilized the ES by 
introducing a bulky N6-N6-dimethyl-substituent at the A35-N6 posi- 
tion (TAR(A35-DMA)) (Fig. 1c), which impairs formation of the ES 
C30eA35 base pair without affecting the bulged-out A35 GS confor- 
mation. This modification quenched the chemical exchange (Sup- 
plementary Fig. 1) and resulted in oppositely shifted chemical shift 
perturbations that are directed towards the GS (Fig. 1d). It also allowed 
observation of the A35-C2H2 resonance, which was otherwise severely 
exchange-broadened (Supplementary Fig. 4), possibly due to protona- 
tion of A35-N1 and formation of a protonated ES A35**C30 wobble 
base pair’*”*. Indeed, we were able to stabilize the ES by reducing the 
pH from 6.4 to 4.6, as verified by analysis of carbon chemical shifts and 
nuclear Overhauser effects (Fig. 1d and Supplementary Figs 1 and 4). 
Conversely, increasing the pH to 7.4 stabilized the GS and quenched the 
chemical exchange (Fig. 1d and Supplementary Fig. 1). 

What is the functional significance of the TAR ES? The ES sequesters 
U31, G34, C30 and A35 into base pairs, such that they are no longer 
available to bind the viral transactivator protein Tat and human 
cyclin T1 (Fig. le), which together activate transcription of the HIV-1 
genome. Notably, analysis of previous mutations reveals that mutants 
that stabilize the TAR ES inhibit Tat/cyclin T1 binding and transcrip- 
tional activation, whereas mutants that do not stabilize the ES have little 
to no effect”*** (Supplementary Fig. 6). The TAR ES is destabilized 
relative to the GS by only ~1.1kcalmol ' (Supplementary Fig. 2), 
and can readily become >50% populated upon binding to one of several 
proteins known to bind TAR and interact with the apical loop”, or by 
other physiochemical parameters such as the lowering of pH. The 
TAR ES may be involved in downregulating transactivation of the 
HIV genome or provide a mechanism for releasing Tat and cyclin T1. 
Although these functional roles remain to be verified, stabilizing the 
autoinhibited TAR ES immediately provides a new route for targeting 
TAR in the development of anti-HIV therapeutics. 
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(Q/21), where @ is the difference between the observed resonance frequency 
and the spin-lock carrier frequency. Shown is a global fit (solid line) to a two- 
state Laguerre equation. Error bars indicate one s.d. c, Mutant mimics of GS and 
ES. d, Comparison of carbon chemical shifts (CS) for the ES, GS and mutant 
mimics. Carbons at the site of mutation are indicated using an asterisk. 

e, Proposed functional role for TAR ES. 


ES structure of the ribosomal A-site 

We used our strategy to characterize the ES structure of the ribosomal 
A-site internal loop” (Fig. 2a). The A-site has essential roles in decoding 
messenger RNA by flipping out two internal-loop adenines (A1492 and 
A1493, referred to hereafter as A92 and A93), which interact with and 
stabilize the codon-anticodon mini-helix formed between the cognate 
aminoacyl tRNA and mRNA”*” (see Fig. 2e). We observed extensive 
carbon chemical exchange at seven residues within and below the A-site 
internal loop (Fig. 2a, b and Supplementary Fig. 1). A two-state analysis 
of the R;, data revealed a global exchange process directed towards 
an ES with population pps ~2.5% and lifetime tgs = 1/k_, ~ 248 [Ls 
(Supplementary Table 1). 

Biophysical studies show that in the GS, A92 is looped inside, 
probably forming a base pair with A08, whereas A93 is partially 
flipped out and flexible*’ (Fig. 2a). An ES involving the flipping out 
of A92 and A93, as observed in several X-ray and NMR structures of 
drug-bound A-site*’, can be ruled out based on the observation of 
exchange below the internal loop, ES chemical shift fingerprints that 
suggest increased stacking for A93 (Supplementary Fig. 5), and by 
comparison of ES chemical shifts with those of drug-bound A-site 
(Supplementary Fig. 7). 

Rather, the breadth of exchange across many different residues 
points to a larger structural rearrangement. The downfield-shifted 
base ES chemical shift for U95-C6 indicates looping out of U95, 
whereas the upfield-shifted base carbon ES chemical shifts indicate 
increased stacking for A92, A93, G94 and C96 (Fig. 2a and 
Supplementary Table 1). These data can be explained by an alternative 
structure in which U95 bulges out while A93*C07, G94eU06 and 
A08eA92 form three consecutive non-canonical base pairs (Fig. 2d 
and Supplementary Table 1). A transition towards such an ES requires 
the opening of C07¢G94, explaining the sizable free-energy barrier of 
~14.8 kcal mol ' (Supplementary Fig. 2)”*. This ES is predicted by 
MC-Fold to be the second most energetically favourable secondary 
structure (Supplementary Fig. 3) and has previously been observed in 
molecular dynamics simulations”. 
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Figure 2 | Excited-state structure of the ribosomal A-site internal loop. 
a, GS and ES structures of the A-site. Chemical shift fingerprints are colour- 
coded as in Fig. 1a. b, Example relaxation dispersion profile (as in Fig. 1b). 


We confirmed the proposed A-site ES using MCSF analysis. We 
were able to block transitions towards the ES by replacing U06eU95 
with a more stable WC G06°C95 base pair (A-site(U06G/U95C)) 
(Fig. 2c). This locked the A-site into the GS as judged by the GS-like 
chemical shifts (Fig. 2d and Supplementary Fig. 5) and absence of 
chemical exchange, including at sites (for example, A92 and A93) that 
are distant from the site of mutation (Supplementary Fig. 1). This 
also confirmed that all sites experience a common global exchange 
process. We then stabilized the proposed ES by deleting U95, which 
bulges out in the ES (A-site(AU95)), and by introducing a methyl 
group at U95-N3 (A-site(U95-N3M)), which is expected to disrupt 
the GS U06*U95 non-canonical base pair in favour of the bulged-out 
ES conformation (Fig. 2c). The A-site(AU95) mutant adopted the 
proposed ES structure as confirmed by NMR (Supplementary Fig. 4) 
and resulted in large changes in the carbon chemical shifts specifically 
at sites showing exchange that are directed towards the ES chemical 
shifts (Fig. 2d and Supplementary Fig. 5). More dramatically, the 
A-site(U95-N3M) mutant exhibited two equally populated sets of 
resonances in slow exchange on the NMR timescale (Supplementary 
Fig. 4), with one set corresponding to the GS and the other in near- 
perfect agreement with the ES (Fig. 2d and Supplementary Fig. 5). 

The A-site ES sequesters A92 and A93 into base pairs, making them 
unavailable to decode mRNA. It also affects the structural presentation 
of A-site residues involved in protein recognition and formation of the 
B2a intersubunit crossbridge (Fig. 2a, e). Thus, we analysed previous 
mutational data in light of the ES A-site structure determined here. 
Interestingly, mutants that are predicted to stabilize the A-site ES 
increase the rates of stop-codon readthrough and frame-shifting, both 
of which are processes that can bypass mRNA decoding™ or inhibit 
binding of initiation factor 1°° (Supplementary Fig. 6). In addition, the 
introduction of chemical groups at the U95-N3 position, a modifica- 
tion that is analogous to that which we used to trap the A-site ES, leads 
to severely impaired association of ribosomal subunits in vitro due to 
disruption of the B2a intersubunit crossbridge*®. This provides strong 
evidence that the A-site ES can form within the ribosome context 
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c, Mutant mimics of GS and ES. d, Comparison of carbon chemical shifts for the 
ES, GS and mutant mimics. e, Proposed functional role for A-site ES. 


where it can affect function. Although X-ray structures of the ribosome 
show the A-site in a GS-like conformation, in several cases, the electron 
density at the A-site is poor as judged by elevated B-factors, and can 
accommodate the ES conformation determined here (data not shown). 
The A-site ES invites reassessment of the A-site region in current 
ribosome structures and suggests a new route for targeting the A-site 
in the development of antibiotics. 


Two ES structures in HIV-1 stem loop 1 


Finally, we used our strategy to study the ES structure of the HIV-1 
stem loop 1 (SL1) (Fig. 3a). SL1 spontaneously forms kissing dimers, 
which isomerize during viral maturation into more stable duplex 
dimers through mechanisms that remain poorly understood*”~” (see 
Fig. 3e). This isomerization requires the melting and re-annealing of 
the SLI hairpin and is catalysed in vivo by the nucleic acid chaperone 
nucleocapsid protein, but can also occur spontaneously in vitro”*". A 
highly conserved asymmetric SL1 internal loop is essential for both 
nucleocapsid-dependent and spontaneous isomerization’, and has 
been shown to induce complex NMR chemical exchange****, 

We observed extensive conformational exchange in a monomeric 
SL1 construct (SLlm)** spanning 7 base pairs in and around 
the internal loop (Fig. 3a, b and Supplementary Fig. 1). Unlike the 
A-site, the exchange extends to residues both below and above the 
internal loop (Fig. 3a) and cannot be globally fitted to a single process 
(Supplementary Table 1). Rather, at least two distinct ESs (ES1 and 
ES2) need to be invoked that are sensed by residues above (ES1, prs: 
~9%, Tes) = W/k_, ~120 us) and below (E82, pgso ~2%, Tes2 = L/k_2 
~200 1s) the internal loop (Fig. 3a). Interestingly, MC-Fold also pre- 
dicts a complex energy landscape for SL1m with several isoenergetic 
secondary structures that feature variable degrees of upward or down- 
ward migration of the bulge (Supplementary Fig. 3). This, together 
with the ES carbon chemical shift fingerprints and MCSF analysis, 
led us to deduce structures for ES1 and ES2 that feature upward and 
downward migration of the bulge, respectively (Supplementary 
Discussion). 
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Figure 3 | Two mutually exclusive excited-state structures in HIV-1 stem- 
loop 1. a, GS and ES structures of SLlm. Chemical shift fingerprints are colour- 
coded as in Fig. la. b, Example relaxation dispersion profile (as in Fig. 1b). 

c, Mutant mimics of GS and ES. d, Comparison of carbon chemical shifts for the 


In ES1, the bulge migrates upward by 3 base pairs***°. Here, G29- 
G28 swap base-pairing partners with A27-G26, A27 swaps with A25, 
and G26-A25 are bulged out (Fig. 3a). We stabilized ES1 using two 
point mutants (SLIm(G8C) and SL1m(G8U)) that replace the ES 
G8eG29 mismatch with the more stable C8*G29 and U8*G29 base 
pairs (Fig. 3c). Both mutants adopted the ES1 structure as verified by 
NMR (Supplementary Fig. 4), and relative to wild-type SL1m resulted 
in large changes in carbon chemical shifts for residues within (G28 
and G29) and above (A25, G26 and A27) the internal loop that are 
directed towards the ES chemical shifts (Fig. 3d and Supplementary 
Fig. 5). In ES2, the bulge migrates downward by 2 base pairs. Here, 
G28-G29 swap base-pairing partners with C30-G31, which are now 
bulged out (Fig. 3a). We stabilized ES2 by replacing the ES2 G7*G28/ 
G8eA27 mismatches with C7*G28/U8*A27 WC base pairs (Fig. 3c). 
This double mutant (SL1m(G7C/G8U)) adopted the proposed ES2 
structure as verified by NMR (Supplementary Fig. 4) and resulted in 
large changes in the carbon chemical shifts for residues within (G28 
and G29) and below (C30 and G31) the internal loop that are directed 
towards the ES chemical shifts (Fig. 3d). Mutant mimics of ES1 and 
ES2 induce similar chemical shift perturbations for G28 (C8 and C1’) 
and G29 (C8), as expected given that they form base pairs in the two 
cases (Fig. 3d, Supplementary Fig. 5 and Supplementary Discussion). 
Notably, mutants that stabilize residues above the bulge in their ES 
conformation also stabilize residues below the bulge in their GS con- 
formation and vice versa (Fig. 3d and Supplementary Fig. 5). This 
supports the mutual exclusivity of ES1 and ES2 (Fig. 3a); ‘trapping’ the 
bulge in the upper (or lower) helix prevents downward (or upward) 
migration and therefore traps residues in the lower (or upper) helix in 
their GS. 

Together, the GS, ES1 and ES2 define a moving zipper in which 
bulge residues invade base pairs in the upper or lower helix. 
Remarkably, an analogous process, if carried out in an intermolecular 
manner between two SL1 monomers, naturally leads to isomerization 
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ESs, GS and mutant mimics. e, Proposed mechanism for spontaneous kissing- 
duplex isomerization. f, Native gel showing the reduction in isomerization rate 
caused by inhibiting exchanging conformations (see Supplementary Fig. 8). 


and duplex formation most probably through a previously proposed 
quadruplex-like intermediate” (Fig. 3e). Here, bulged-out G28 and 
G29 can invade base pairs in the upper or lower helix in another 
monomer to generate ES1- or ES2-like intermolecular base pairs 
(Fig. 3e). The bulged-out G26 and A25 or C30 and G31 can, in turn, 
carry out further intermolecular strand invasions, and this process 
can be repeated to generate a duplex dimer (Fig. 3e). In support of this 
important role for ES1 and ES2 in SL1 isomerization, mutations that 
trap ES1 or inhibit formation of ES2 significantly diminish the rate of 
isomerization, whereas control sequences that preserve the stability 
of the stem-loop without disrupting conformational exchange show 
little to no effect (Fig. 3f and Supplementary Fig. 8). Thus, transitions 
between the GS and ES can promote ATP-independent changes in 
RNA secondary structure without disrupting the structural integrity 
of entire hairpins, which may be required for other functions, such as 
the formation of kissing dimers in SL1. 

Compared to secondary structural transitions observed in many 
regulatory RNA switches’, transitions between the ground and 
excited states uncovered here involve much more localized changes 
in RNA structure, occur at rates that are two-to-four orders of mag- 
nitude faster, and do not require assistance from external factors. 
Thus, they can meet unique demands in biological circuits and macro- 
molecular machines. The ESs also present new drug targets and offer 
new opportunities in the engineering of RNA-based devices. Line- 
broadening indicative of ESs is routinely observed in NMR spectra 
of RNA and we therefore predict that RNA ESs exist in great abund- 
ance throughout the transcriptome. By combining NMR data with 
structure prediction tools, it should be possible to determine the 
three-dimensional structures of RNA ESs at atomic resolution. 


METHODS SUMMARY 


Detailed methods on RNA sample preparation and assignment, NMR relaxation 
dispersion data collection and analysis, and isomerization assays can be found in 
Methods. 
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METHODS 


Preparation and NMR resonance assignment of labelled and unlabelled RNA. 
RNA samples were prepared by in vitro transcription using T7 RNA polymerase 
(Takara Mirus Bio, Inc.), uniformly '°C/'°N-labelled nucleotide triphosphates 
(ISOTEC, Inc., Cambridge Isotope Labs) or unlabelled (Sigma-Aldrich) nucleo- 
tide triphosphates, and synthetic DNA templates (Integrated DNA Technologies, 
Inc.) containing the T7 promoter and sequence of interest. All RNAs were puri- 
fied by 20% (w/v) denaturing polyacrylamide gel electrophoresis, using 8 M urea 
and 1X TBE (89 mM Tris-borate, 89 mM boric acid, 2 mM EDTA). The RNA was 
electro-eluted from the gel in 20mM Tris pH 8 buffer followed by ethanol pre- 
cipitation. The RNA pellet was dissolved in water, annealed by heating to 95 °C 
for 10 min and rapid cooling on ice and exchanged into NMR buffer (15 mM 
sodium phosphate, 0.1 mM EDTA, and 25mM NaCl at pH 6.4) multiple times 
using an Amicon Ultra-4 Centrifugal Filter Unit (Millipore Corp.). Unlabelled 
RNA samples (TAR(C30U), TAR(A35-DMA), TAR(A35G), A-site(AU95), 
A-site(U95-N3M)) were purchased from Dharmacon (Thermo Fisher Scientific) 
and Integrated DNA Technologies and dissolved in NMR buffer (15 mM sodium 
phosphate, 0.1 mM EDTA, 25 mM NaCl, pH 6.4). The TAR pH studies employed 
the following NMR buffers: pH 7.4 (15 mM sodium phosphate, 0.1 mM EDTA and 
25mM NaCl) and pH 4.6 (15 mM acetate-dy, 0.1 mM EDTA, and 25 mM NaC)). 
Resonance assignments of wild-type RNA samples were obtained from prior 
studies'**>° and confirmed using standard resonance assignment experiments. 
Carbon R,, relaxation dispersion. All relaxation dispersion NMR experiments 
were performed on a Bruker Avance 600 MHz NMR spectrometer equipped with 
a 5-mm triple-resonance cryogenic probe. Experiments were performed at 25 °C, 
25/15 °C, and 15°C for TAR, A-site and SL1m, respectively, using uniformly 
®¢/"°N-labelled RNA constructs shown in Supplementary Fig. 1. For TAR, we 
used a second construct lacking the bulge (EIJ-TAR, Supplementary Fig. 1) to 
measure dispersion data for U31-Cé6 resonance, which is otherwise overlapped. 
For A-site, all data were measured at 25 °C with the exception of A92-C2 and 
A93-C8, which was measured at 15 °C to push the system into slower exchange 
and obtain more reliable data. Rotating frame carbon Rj, relaxation dispersion” 
data were measured using a 1D acquisition scheme that extends the sensitivity to 
chemical exchange into millisecond timescales relative to conventional 2D relaxa- 
tion dispersion methods’. On- and off-resonance relaxation dispersion data 
were recorded at various offset frequencies (Q) and spinlock powers (@,) (see 
Supplementary Table 2). The following relaxation delays were used. TAR: C30, 
U31, G34 and A35 Cl’ [0, 6 (2X), 14, 24 (2X) ms]; G32 C1’ [0, 12 (2X), 30, 50 
(2X) ms]; G33 Cl’ [0, 14 (2X), 34, 55 (2X) ms]; U31 C6 (measured on EII-TAR) 
[0, 7 (2X), 14, 28 (2X) ms]; G32 and A35 C8 [0, 10 (2X), 21, 34, 45, 55 (2X) ms]; 
G33 C8 [0, 8 (2X), 20, 35 (2X) ms]; G34 C8 [0, 5 (2X), 11, 20 (2X) ms]. A-site: 
G05 C8, G91 C1’ [0, 8, 16, 30, 36 (2X) ms]; A92 C1'/C2/C8, G94C [0, 8, 16, 24, 32 
(2X) ms]; A93 C1’, U95 C6 [0, 4, 8, 14, 20 (2X) ms]; A93 C8 [0, 8, 22, 34, 44 (2X) 
ms]; C96 C6 [0, 8, 18, 24 ms]. SLIm: A3 C2 [0, 8 (2X), 10 ms], G7 C8 [0, 8 (2X), 
16 ms], G8 C8 [0, 3.3 (2X), 5ms], U9 C6 [0, 7 (2X), 12 ms], A24 C8 [0, 9 (2X), 
12 ms], A24 C2 [0, 12, 16 (2X) ms], A25 C8 [0, 10, 14 (2X) ms], A25 C2 [0, 8 
(2X), 12 ms], G26 C8 [0, 1.5, 5 (2X), 17 ms], G26 C1’ [0, 12 (2X) ms], A27 C8 [0, 
4, 10, 17, 25 (2X) ms], A27 C2 [0, 4, 10, 17, 25 (2X) ms], G28 C8 [0, 11, 14 (2x) 
ms], G28 Cl’ [0,7 (2X), 9 ms], G29 C8 [0, 6, 8 (2X) ms], G29 Cl’ [0,7 (2X) ms], 
C30 C6 [0, 5 (2X), 14 ms], G31 C8 [0.3, 11 (2X), 15 ms], A32 C8, U34C6 A32 C2, 
[0, 10 (2X), 15 ms]; G33 C8 [0, 12 (2X), 15 ms]. 

Data points that meet C-C Hartmann-Hahn matching conditions were omitted 
from analysis as previously described’. Data were processed using nmrPipe® and 
the R,, values were computed by fitting the resonance intensities with mono- 
exponential decays using Mathematica 6.0 script’’ (Wolfram Research, Inc.). 
The relaxation dispersion data were fitted’ to fast exchange (equation (1), three 
independent variables), asymmetric exchange (equation (2), five independent 
variables), and the Laguerre equation (equation (3), five independent variables) 
using Origin 8.5.1 (OriginLab). 

Fast exchange (k.. >> Aw): 


kx 
Rip=Ri cos? 0+ R» sin? 0+ sin’ (ar) 
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Asymmetric exchange (pgs > Pes): 
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Laguerre equation (general): 
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R, and R; are the intrinsic longitudinal and transverse relaxation rates, respect- 
ively (assumed to be identical for GS and ES); Q = Q,,, — @;¢ is the offset of the 
spin-lock carrier frequency («;) from the averaged resonance frequency (Qobs)5 
@egris the effective spin-lock strength; 0 = arctan(;/Q); Qovs = pas@cs + PrsQess 
where pgs (Pgs) is the ground (excited) state fractional population (pgs + Pes = 1); 
Kex = ky + k_, is the exchange rate constant for a two-state equilibrium, where 
ky = prskex and k_, = paskex are the forward and reverse rate constants, respect- 
ively. Note that whereas for pps < 2%, Qops ~ Qes, this is not the case for signifi- 
cantly populated ESs, such as TAR and SL1m-ES1 (~13% and 9%, respectively). 
Model selection was carried out using an F-test (Supplementary Table 1), 
which uses chi-squared (7), applying the Levenberg-Marquardt minimization 
algorithm, to determine the feasibility of a model (for example, individual fits) 
versus a more complex model (that is, shared-parameter/3-state fits, number of 
independent variables equal number of reported parameters) expanded from the 
first model. In general, similar Aw values were obtained when fitting dispersion 
data using asymmetric (equation (2)) and Laguerre (equation (3)) equations. 
Errors were determined using standard Monte Carlo simulations” and verified 
using Boot strapping approaches for error analysis**°’ (data not shown). For 
TAR, all fast exchanging resonances were combined in a global fit except U31- 
C6. For the A-site, four resonances (G91-C1’, A92-C1’, U95-C6, C96-C6) can be 
globally fitted according to the F-test and the remaining resonances (G05-C8, 
A92-C8, G94-C8) can be included into the global fit without affecting the result- 
ing fitted parameters (values are within error when globally fitting four or seven 
resonances). A-site data measured at 15°C were fitted individually although 
similar Aw values were obtained when these data were included in global fits 
with other data measured at 25 °C. The Aw values obtained from both individual 
and global fits are shown for A92-C2 in Supplementary Fig. 5. For SL1, the G26- 
C8, A25-C8 and A25-C2 were combined in a global fit to characterize ES1, and 
G31-C8, C30-Cé and G7-C8 were combined in a global fit to characterize ES2. 
G28-C8, G28-Cl’ and G29-C8 were included in a global fit to ES1 and ES2 using a 
three-state model. A27-C8 and G8-C8 were fitted individually using single and 
three-state exchange models. Aw values obtained for dispersion profiles with 
R.x <5 Hz or that yielded ambiguous signs for Aw during the Monte Carlo error 
analysis were deemed unreliable (these include A-site: GO5-C1', U06-C1’, U06- 
C6, C07-C1' and A08-C2; SL1m: G29-C1' and G26-C1'). The sign of Aw for TAR 
U31-C6 was deduced from the pH-dependent perturbations. Data that failed the 
above criteria but that could be included in global fitting as judged using an F-test 
were included in the global fitting (TAR: C30-C1') or individually fitted assuming 
kx and prs values determined by globally fitting the dispersion data (SL1m, C30- 
C6 and G7-C8, and TAR, U31-C6). 
Thermodynamic analysis. The free energy difference between the GS and ES 
(AG**) and between the GS and transition state (AG") was computed using (with 


AGS = 0): 
: kh k_jh 
ES 1 1 
sc (— n(n) (tn) a) 


kih 
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c n( 2 


where k,/_, are rate constants, h is Planck’s constant, kg is Boltzmann’s constant, 
R is the gas constant and T is temperature. 

SL1 isomerization assay. SL1 isomerization assays were performed closely 
following the procedure described previously”. Briefly, SLI RNA samples (SL1, 
SL1(G8C), SL1(tGC), SL1(eGC)) (Fig. le and Supplementary Fig. 8) containing 
the wild-type apical loop were purchased from Integrated DNA Technologies, 
Inc. RNA samples were dissolved in water to a concentration of 5 11M, heated to 
95 °C for 3 min and placed on ice for 30 min. Subsequently, 50% (v/v) 2 dimeri- 
zation buffer (20mM sodium phosphate, pH 6.4, 100 mM NaCl, and 0.2mM 
MgCl.) was added to produce a final RNA concentration of 2.5 1M, and the 
sample incubated at 55°C or on ice for a variable amount of time. Native gels 
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were run using TBE buffer and control with TBM (TBE with no EDTA but 10 mM 
MgCl.) as previously described*’ and detected with ethidium bromide staining. 
MC-fold predictions of RNA secondary structure. All RNA secondary struc- 
tures were predicted based on sequence using the program MC-Fold* (http:// 
www.major.iric.ca/MC-Fold/) with standard input options. 
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An over-massive black hole in the compact 


lenticular galaxy NGC 1277 


Remco C. E. van den Bosch!”, Karl Gebhardt’, Kayhan Giiltekin*?, Glenn van de Ven’, Arjen van der Wel! & Jonelle L. Walsh? 


Most massive galaxies have supermassive black holes at their cen- 
tres, and the masses of the black holes are believed to correlate with 
properties of the host-galaxy bulge component". Several explana- 
tions have been proposed for the existence of these locally estab- 
lished empirical relationships, including the non-causal, statistical 
process of galaxy-galaxy merging’, direct feedback between the 
black hole and its host galaxy’, and galaxy-galaxy merging and 
the subsequent violent relaxation and dissipation*. The empirical 
scaling relations are therefore important for distinguishing 
between various theoretical models of galaxy evolution*®, and they 
furthermore form the basis for all black-hole mass measurements 
at large distances. Observations have shown that the mass of the 
black hole is typically 0.1 per cent of the mass of the stellar bulge of 
the galaxy’®*. Until now, the galaxy with the largest known fraction 
of its mass in its central black hole (11 per cent) was the small galaxy 
NGC 4486B'”. Here we report observations of the stellar kinema- 
tics of NGC 1277, which is a compact, lenticular galaxy with a mass 
of 1.2 x 101! solar masses. From the data, we determine that the 
mass of the central black hole is 1.7 X 10’° solar masses, or 59 per 
cent of its bulge mass. We also show observations of five other 
compact galaxies that have properties similar to NGC 1277 and 
therefore may also contain over-massive black holes. It is not yet 
known if these galaxies represent a tail of a distribution, or if disk- 
dominated galaxies fail to follow the usual black-hole mass scaling 
relations*’®. 

Direct measurements of black-hole mass often rely on obtaining 
spatially resolved stellar or gas kinematics within the black hole’s 
‘sphere of influence’, that is, the region over which it dominates the 
gravitational potential. We have obtained long-slit spectroscopy of 700 
nearby galaxies with the Marcario Low Resolution Spectrograph" on 
the Hobby-Eberly Telescope, Texas, to find suitable targets for direct 
measurements of black-hole mass (Supplementary Information). As 
shown in Table 1, six of these galaxies have very peculiar properties; 
they have velocity dispersions of ¢ > 350kms_ ‘and half-light radii of 
Re <3kpe. It is unusual for such small galaxies to have such large 
dispersions, which signify unusually high central mass concentrations: 
a simple virial mass estimate indicates that the central 200 pc of the 


galaxies listed in Table 1 contains more than 10 billion solar masses, 
which is 100 times more than typical galaxies of the same size. 

Black-hole masses can be measured directly by fitting self-consistent 
Schwarzschild models’ to spatially resolved spectroscopy data and 
high-resolution imaging. Archival Hubble Space Telescope (HST) 
imaging is available for one of these six dense galaxies, NGC 1277. 
On the basis of the HST imaging (Fig. 1) and the stellar kinematics 
(Fig. 2), we constructed 600,000 orbit-based models using iterative 
refinement to search parameter space'*”*. The best-fit model is then 
found by marginalizing over all parameters: the stellar mass-to-light 
ratio, the black-hole mass and the mass and concentration of the 
Navarro-Frenk-White dark halo’*. The confidence intervals are deter- 
mined with the goodness-of-fit statistic 77. We measure a black-hole 
mass of (17 +3) X 10° solar masses (Mo) and a total stellar mass 
of (1.2 + 0.4) x 10''Mo, with 1-s.d. confidence intervals based on 
Ay’ =1 after marginalizing over the dark-halo parameters (Sup- 
plementary Information). The black hole in NGC 1277 is one of the 
most massive black holes to be dynamically confirmed, and moreover 
has a mass fraction of 14% of the total stellar mass in the galaxy. 

No galaxy with such a large ratio of black-hole mass to stellar mass 
has previously been seen. Owing to the strong disk-like rotation 
(Fig. 2) and the lack of an unambiguous bulge in NGC 1277 (Fig. 1), 
it is unclear where to evaluate its black hole against the relation 
between black-hole mass and bulge luminosity. The central pseudo- 
bulge contains 24% of the light (Fig. 1) and the black-hole/bulge mass 
fraction is 59%. As shown in Fig. 3, NGC 1277 is a significant outlier 
from the mass-luminosity relation, by two orders of magnitude. At a 
fixed bulge luminosity of 3 X 10'°Lx,.g to 10 X 10'°LK,@, where Lk. 
is the K-band solar luminosity, dynamical measurements of black-hole 
mass now range over four orders of magnitude, from 10°Mo to 
10'°Mo, showing that bulge (or pseudobulge) luminosity is not a good 
predictor of black-hole mass. 

We now place NGC 1277 on the relation between black-hole mass 
(M,) and velocity dispersion. The average velocity dispersion inside 
the half-light radius (2.8’") and outside the sphere of influence (1.6’’) 
for NGC 1277 is ¢ = 333kms 1, according to a reconstruction of the 
best-fit orbit-based model. For this value of o, the most recent inferred 


Table 1 | Global properties of the six compact, high-dispersion galaxies 


Object Distance (Mpc) ac (kms!) Re (kpc) log(Lk,o) @(1-b/a) 
ARK90 131 392+4 1.6 11.2 0.7 
GC 1270 69 393 +3 18 11.2 0.8 
GC1277 73 403 +4 1.6 11.1 0.5 

UGC 1859 82 362+4 2.0 11.2 0.6 
UGC 2698 89 3973 27 11.4 0.7 
RK 1216 94 354+4 19 11.2 0.6 


The six galaxies presented here were observed with the Marcario Low Resolution Spectrograph’! as part of a large survey programme. We targeted galaxies from the Two Micron All Sky Survey (2MASS) extended- 
source catalogue** that are expected to have the largest spheres of influence. Our predictions of sphere of influence assume that the galaxies follow the relation between black-hole mass and host-galaxy velocity 
dispersion’. For those 2MASS galaxies without a known velocity dispersion value, we used an estimate based on the fundamental-plane relation between galaxy size, surface brightness and velocity dispersion?®. 
See the Supplementary Information for more information on the survey. The columns show the following near-infrared properties: distance from Hubble flow (column 2); stellar velocity dispersion extracted froma 
central aperture (column 3); and 2MASS* half-light radius (K band; column 4), total luminosity (K band; column 5) and apparent ellipticity (column 6), where a and b are respectively the lengths of the apparent 


major and minor axes. 


1Max-Planck Institut fiir Astronomie, Kénigstuhl 17, D-69117 Heidelberg, Germany. @Department of Astronomy, The University of Texas at Austin, 1 University Station C1400, Austin, Texas 78712, USA. 


3Department of Astronomy, University of Michigan, Ann Arbor, Michigan 48109, USA. 
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Figure 1 | Optical HST image of the compact lenticular galaxy NGC 1277. 
The image is scaled with the logarithm of the luminosity and is 19 kpc X 8 kpc, 
with north pointing up and east to the left. In this high-resolution optical 
image, the galaxy has a half-light radius of 1 kpc, is strongly flattened and is 
disky. It is clear that a superposition of multiple galaxies does not explain the 
high velocity dispersion. NGC 1277 has a small, regular, nuclear dust disk with 
an apparent axis ratio of only 0.3, which indicates that we see the galaxy close to 
edge-on. Through a multicomponent fit”® to the HST image, we identify the 
inner component, with a half-light radius of 0.3 kpc and a Seérsic index of n ~ 1, 
as a pseudobulge that contains 24% of the light. For the dynamical modelling, 
we construct a three-dimensional luminous-mass model of the stars by 


relation’® predicts a black-hole mass of 2.4 X 10°M, ©> So the measured 
value is almost one order of magnitude higher, or a 2.1-s.d. outlier 
relative to the intrinsic scatter in the M.-c relation’. 
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Figure 2 | Line-of-sight stellar kinematics of NGC 1277. The stellar 
kinematics as observed with the Marcario Low Resolution Spectrograph”, shown 
with 1-s.d. error bars, and measured” at 31 locations along the major axis of 
NGC 1277 (Supplementary Information): mean velocity (a); velocity dispersion 
(b); and higher-order Gauss—Hermite velocity moments” h; (c) and hy 

(d), respectively representing skewness and kurtosis. The kinematics show 
remarkably strong rotation and a dispersion profile that strongly peaks towards the 
centre. The best-fit Schwarzschild model (black line) corresponds toa 17 X 10°Mo 
black hole. The relation between black-hole mass and host luminosity predicts a 
10°Mog black hole, but the corresponding model (red dot-dash line) does not fit the 
data at all. The telescope resolution (seeing full-width at half-maximum, 1.6’’) is 
indicated in b and is sufficient to resolve the sphere of influence of the black hole. 
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de-projecting the two-dimensional light model from the HST image. Then the 
gravitational potential is inferred from the combined luminous, black-hole and 
dark-matter halo mass distribution. In this potential, representative orbits are 
integrated numerically, keeping track of the path and velocity along each orbit. 
We then create a reconstruction of the galaxy by assigning each orbit an 
amount of light, such that the model recreates the total light distribution, while 
simultaneously fitting the long-slit stellar kinematics observed with the Hobby- 
Eberly Telescope (Fig. 2). The models include the effect of the Earth’s 
atmosphere and the telescope optics without any a-priori assumption on the 
orbital configuration (Supplementary Information). 


Apart from NGC 1277, NGC 4486B? and Henize 2-10” are known 
to lie significantly above the relations, and at least three galaxies are 
known to lie significantly below the relations'®’*"”. We do not yet 
know if these over-massive and under-massive black holes just lie in 
the tails of a relatively narrow distribution of joint black-hole/galaxy 
properties, or if they demonstrate non-universality. Only through 
more black-hole measurements, including those in the other five com- 
pact galaxies with high velocity dispersions, we will be able to establish 
the cause of the black-hole/galaxy connection. 
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Figure 3 | The correlation between black-hole mass and near-infrared bulge 
luminosity, Lx, ©. The black line shows the mass-luminosity relation® for 
galaxies with a directly measured black-hole mass. NGC 1277 is a significant 
positive outlier. In addition to the galaxies (black dots) to which the relation has 
been fitted*, eight black-hole masses (NGC 4486B’, triangles’®, squares’) have 
been added with 2MASS K-band bulge luminosities. The error bars denote 
1-s.d. uncertainties, except for the NGC 1277 bulge luminosity, where we use its 
total luminosity as a conservative upper limit. 
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A stellar population analysis of NGC 1277” indicates that it contains 
only old (28 Gyr) stars and that there has not been any recent star 
formation. The black hole must thus have been in place for at least 
8 Gyr, because black-hole accretion without much star formation or 
the formation of a (classical) bulge is highly unlikely. Furthermore, 
there is no strong evidence that NGC 1277 has been tidally stripped, 
because its isophotes are extremely regular and disky, it seems to have a 
normal dark-matter halo as inferred from the dynamical model, and at 
large radii the rotation curve is flat out to five times the half-light radius. 

Although the six compact galaxies presented in Table 1 are unusual 
in the present-day universe, they are quantitatively similar to the typical 
red, passive galaxies at much earlier times (at redshifts of z ~ 2): those 
are also found, on average, to be smaller than similarly massive galaxies 
in the present-day universe’', possibly possess high velocity disper- 
sions” and generally have a disk-like structure”’. Perhaps the compact 
systems we found are local analogues of these high-redshift galaxies. 
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Active upper-atmosphere chemistry and dynamics 
from polar circulation reversal on Titan 


Nicholas A. Teanby', Patrick G. J. Irwin”, Conor A. Nixon*, Remco de Kok‘, Sandrine Vinatier®, Athena Coustenis”, 


Elliot Sefton-Nash!°, Simon B. Calcutt? & F. Michael Flasar? 


Saturn’s moon Titan has a nitrogen atmosphere comparable to 
Earth’s, with a surface pressure of 1.4 bar. Numerical models repro- 
duce the tropospheric conditions very well but have trouble explain- 
ing the observed middle-atmosphere temperatures, composition 
and winds’. The top of the middle-atmosphere circulation has been 
thought to lie at an altitude of 450 to 500 kilometres, where there is a 
layer of haze that appears to be separated from the main haze deck’. 
This ‘detached’ haze was previously explained as being due to the co- 
location of peak haze production and the limit of dynamical trans- 
port by the circulation’s upper branch‘. Here we report a build-up of 
trace gases over the south pole approximately two years after obser- 
ving the 2009 post-equinox circulation reversal, from which we con- 
clude that middle-atmosphere circulation must extend to an altitude 
of at least 600 kilometres. The primary drivers of this circulation are 
summer-hemisphere heating of haze by absorption of solar radia- 
tion and winter-hemisphere cooling due to infrared emission by 
haze and trace gases’; our results therefore imply that these effects 
are important well into the thermosphere (altitudes higher than 
500 kilometres). This requires both active upper-atmosphere chemi- 
stry, consistent with the detection of high-complexity molecules and 
ions at altitudes greater than 950 kilometres’, and an alternative 
explanation for the detached haze, such as a transition in haze par- 
ticle growth from monomers to fractal structures®. 

Saturn’s 26.7° obliquity means that Titan’s atmosphere experiences 
large solar flux variations during its 29.5-yr orbit around the Sun. For 
most of Titan’s year, middle-atmosphere (stratosphere and mesosphere 
at altitudes between 100 and 500 km) circulation is predicted to com- 
prise a single pole-to-pole circulation cell, with summer-hemisphere 
upwelling, winter-hemisphere subsidence and a winter-hemisphere cir- 
cumpolar vortex'**’-"*. This was confirmed during northern winter 
using measurements of temperature and trace-gas abundance made by 
NASA’s Cassini spacecraft’*"*, Titan experienced northern spring equi- 
nox on 11 August 2009, around which time changes in solar flux distri- 
bution were predicted to cause a reversal of the middle-atmospheric 
circulation. Such dynamical changes can be probed using profiles of 
temperature and trace-gas abundance’ derived from infrared spectra 
measured with Cassini’s Composite Infrared Spectrometer’? (CIRS). 
Therefore, to investigate the reversal mechanism we analysed all avai- 
lable south polar limb (horizontal viewing) CIRS observations made in 
the 4-yr period centred on the equinox. This included measurements at 
complementary high (0.5 cm ') and low (14cm) spectral resolutions 
(Supplementary Information, Supplementary Fig. 1 and Supplementary 
Table 1). 

The CIRS observations show that a very large increase in high- 
altitude trace-gas emission occurred over the south pole sometime 
between late 2010 and mid 2011 (Fig. 1). From both high- and low- 
resolution observation sequences, we derived altitude profiles of tem- 
perature, HCN, HC3N and C,H) using a nonlinear optimal estimation 


inversion method” that closely follows our previous studies'*”* (Sup- 
plementary Information and Supplementary Table 2). Additionally, the 
high-resolution data allowed determination of C3Hy, C,H, and CeH, 
profiles whose emission peaks were too close together to be resolved in 
the low-resolution data. 

Inversion results show very rapid changes in atmospheric tempe- 
rature and composition, especially after the equinox (Figs 2-4 and 
Supplementary Fig. 2). The observed south polar warming at altitudes 
above 300 km suggests that subsidence over the south pole is initiated 
just after equinox, with the increased temperature being due to adia- 
batic heating as upper atmosphere air is advected to higher pressures 
and compressed. This is similar to the process that caused a subsidence- 
induced north polar hotspot during northern winter'***. The observed 
temperature structure implies subsidence velocities of 0.5-2.0mms_', 
broadly consistent with predictions from numerical models (Supplemen- 
tary Information and Supplementary Table 3). Subsidence is weakest 
just after equinox in early 2010, at 0.5mms° ', but quickly increases to 
15mms ’ by June 2010 (2010.43). Cooling observed in the stratosphere 
(150-300 km) suggests that for the period covered by our data subsidence 
does not extend to lower altitudes. The cooling by 20 K that is evident 
between January 2010 (2010.04) and September 2011 (2011.70) at alti- 
tudes of 150-300 km (Fig. 4) is most likely due to radiative cooling from 
the lower atmosphere, which since equinox has been experiencing 
reduced insolation, and is consistent with the ~1-yr cooling timescale 
at these altitudes”. 

Changes in upper-atmosphere composition occur on similarly short 
timescales (Figs 3 and 4 and Supplementary Fig. 2), with evidence of 
large increases in trace-gas abundances occurring in 2011. This can be 
explained by a combination of subsidence and photochemically 
induced vertical gradients. High-altitude (>500km) photochemical 
reactions produce trace compounds such as HCN and HCG3N, which 
are transported into the lower atmosphere by vertical mixing pro- 
cesses, where they are destroyed by photolysis or removed by con- 
densation near the tropopause™*. The result is increasing relative 
abundances of these compounds with altitude and a vertical gradient 
inversely proportional to species lifetime’. Subsidence would advect 
these profiles downwards, causing enrichment at lower atmospheric 
levels'* and explaining the observed increase. 

Therefore, both observed temperature and abundance increases are 
consistent with mesospheric (>300 km) south polar subsidence dur- 
ing the post-equinox period. This implies a recent reversal in circu- 
lation direction for the south polar mesosphere, relative to the 
circulation direction derived earlier in the mission!*"’*. An alternative, 
purely radiative, explanation for the temperature changes can be 
rejected. Radiative time constants at mesospheric altitudes are short 
relative to Titan’s seasons, and temperature should thus in theory be 
able to react rapidly to changes in seasonal solar flux. However, photo- 
chemical lifetimes of most trace gases are comparable to or greater 
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Figure 1 | Rapid south polar atmospheric change observed using infrared 
spectra. These observations were made using Cassini CIRS after the 11 August 
2009 equinox and are grouped as follows: before 1 January 2011 (a-c); after 1 
January 2011 (d-f). The spectra indicate that between late 2010 and early 2011 
there was a large increase in trace-gas emission at the south pole. This is 
observed in three independent observation sequences at both high and low 
spectral resolution. Altitude refers to the tangent altitude, which is the closest 
distance to Titan’s surface reached by the line-of-sight vector, and is 
approximately 450 km for these spectra. Grey areas indicate measurement error 


than seasonal timescales. Therefore, to be consistent with all our data, 
the observed changes must be due to a reversal of the circulation as 
opposed to changes in direct solar heating. 

We note that, whereas the January 2010 (2010.04) temperature 
results show polar warming at high altitudes almost immediately after 
equinox, there is no evidence for large increases in trace-gas abun- 
dances until much later. In fact, the first evidence for increases in south 
polar trace gas is in June 2011 (2011.47) (Fig. 3), and this is corrobo- 
rated by subsequent observations in September 2011 (2011.70) (Fig. 4 
and Supplementary Fig. 2). However, unlike increases in south polar 
temperature caused by adiabatic heating, trace gases take time to advect 
from upper-atmosphere source regions to observable altitudes, which 
means that the temporal offset between temperature and composition 
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Figure 2 | South polar seasonal temperature changes. Cross-sections were 
derived from low-spectral-resolution limb-mapping sequences and cover pre- 
equinox (a) and post-equinox (b-d) periods. Substantial stratospheric 

(<300 km) cooling occurs after the equinox, consistent with reduced total solar 
flux during this time’’”’, as Titan moves towards southern winter. After the 
equinox (after mid-2009), there is evidence for high-altitude (450 km) polar 
warming relative to more equatorial latitudes. This is initially present as a small 
(2 K) temperature anomaly almost immediately proceeding the equinox 

(b), which increases to 6 K (c) and then to 8 K (d) in subsequent sequences. This 
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envelopes (s.e.). We focused on the 610-740 cm! (16.4-13.5 jum) spectral 
region, which contains strong trace-gas emission features. Since the 2009 
northern spring equinox, Cassini remained in an equatorial orbit around 
Saturn, which was ideal for limb sounding (horizontal viewing), and many limb 
measurements of the south polar region were taken (Supplementary Table 1). 
Most observations were of a single latitude, but several limb-mapping 
sequences were also measured, covering multiple latitudes at a time and 
allowing the determination of latitude—altitude cross-sections through the 
atmosphere (Figs 2 and 3). 


results is not inconsistent. Our results suggest that this advection pro- 
cess takes approximately 1.5-2 yr after reversal initiation. This cor- 
responds to ~100km of polar subsidence, assuming the 1.5mms 
subsidence rate inferred from polar temperature anomalies. 

An independent check on this interpretation and on south polar 
subsidence rates can be obtained from the composition results them- 
selves. Polar abundance increases at 450 km are at least an order of 
magnitude for all gases (Fig. 4 and Supplementary Table 4) except 
C,H, which has a more muted response in keeping with its longer 
atmospheric lifetime and shallower vertical gradient. A first-order 
approximation, combining results from all gases and assuming no 
photochemical alteration of gas profiles, implies average subsidence 
velocities between January 2010 (2010.04) and September 2011 
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implies that the mesospheric circulation has reversed and is now subsiding at 
the south pole. The strongest polar warming occurs in the most recent 
observation, indicating the fastest subsidence speeds. Grey regions indicate 
latitudes and altitudes where observations exist but have insufficient signal-to- 
noise ratios for an accurate temperature determination. Contour spacing is 2 K, 
which is the maximum uncertainty for this altitude range. These changes are 
confirmed by additional single-latitude observations at both high (Fig. 4) and 
low (Supplementary Fig. 2) spectral resolution. 
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Figure 3 | Seasonal changes in south polar trace-gas abundances. Cross- 
sections were derived from low-spectral-resolution limb-mapping sequences 
and show results for June 2010 (2010.43) (a—-c) and June 2011 (2011.47) (d-f). In 
the 2011 observation, trace-gas abundances have increased substantially at high 
altitudes (>450 km) over the south pole. The most pronounced increases occur 
for HCN and HCN. This is consistent with temperature determinations in 
Fig. 2 and implies a reversal in mesospheric circulation, with subsidence now 


(2011.70) of 0.8-2.3mms°' (Supplementary Information and Sup- 
plementary Table 4). This is in excellent agreement with values of 
0.5-2.0 mms | derived from the temperature results. 

The mechanism for reversal of middle-atmosphere circulation is 
related to solar flux distribution and angular momentum transfer. 
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Figure 4 | Profiles of temperature and composition taken close to the 
equinox. All profiles were derived from high-spectral-resolution (0.5 cm‘), 
single-latitude limb integrations. a, Temperature profiles show evidence for 
mesospheric polar warming, which increases and moves to lower altitude with 
time. b-g, Large increases in all trace-gas species are visible in September 2011 
(2011.70). Thin dashed lines, thin solid lines, and light blue shading indicate the 
error envelope (s.e.) for August 2009 (2009.65), January 2010 (2010.04) and 
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occurring at the south pole. Grey regions indicate latitudes and altitudes where 
observations exist but have insufficient signal-to-noise ratios for an accurate 
abundance determination. We note that HC3N cannot be reliably determined 
in June 2010 (2010.43) owing to its very low relative abundance (< 10°). Low- 
resolution mapping sequences taken before June 2010 (2010.43) show 
comparable compositions to those in a-c. VMR is the volume mixing ratio and 
quantifies the relative atmospheric abundance of each species. 


Stratospheric temperatures are not symmetric at northern spring equi- 
nox, but are slightly warmer in the south’’**”*. Therefore, given that 
temperatures and zonal winds are coupled by the thermal wind equa- 
tion, during springtime the atmosphere has to transport angular 
momentum from the pole leaving winter, where the circumpolar 
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September 2011 (2011.70) observations, respectively. The short-dashed line in 
a indicates the a-priori profile used to start the inversion. HC3N (b) and CsH, 
(g) are not reliably detected in August 2009 (2009.65) or January 2010 
(2010.04), so 1o upper limits are given instead (arrows). These high-spectral- 
resolution observations confirm the trends seen in the low-spectral-resolution 
mapping sequences (Figs 2 and 3). 
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winds are strong, to the pole moving towards winter, where the winds 
are weak”*. This results in a cross-equatorial circulation from north to 
south at high altitudes, driving subsidence in the south polar atmo- 
sphere and explaining the observed adiabatic heating and increased 
trace-gas abundances. Our observations of temperature now constrain 
the mesospheric reversal timing to shortly after equinox—perhaps 
coincident with it, but certainly within six months (0.015 Titan years). 
Furthermore, at present the observed spatial distribution of abundance 
and temperature increases limits the main zone of subsidence to 90- 
70° S, with subsidence velocities of 0.5-2.3 mms '. 

The observed reversal timing agrees very well with recent atmo- 
spheric general circulation models'’*"’. Unfortunately, direct compa- 
rison of observed temperature and composition with results of these 
models is not possible at the moment because, as noted previously’”, the 
models havea relatively low upper boundary, which means that whereas 
our peak seasonal signal occurs above 400 km, the models are only 
applicable at lower altitudes. Decoupling of tropospheric and stra- 
tospheric circulation means that this has limited effect on the lowermost 
atmosphere, and there is generally good agreement between models and 
observations for the non-superrotating troposphere and the surface’. 
However, the low model top distorts the middle-atmosphere structure 
and places the polar zonal jets at too low an altitude compared with 
observational constraints'***. Our results show that the reversal in 
atmospheric circulation takes place throughout the mesosphere, with 
temperatures implying subsidence in the 300-500-km altitude range 
and composition implying subsidence up to an altitude of at least 
600 km, into photochemical source regions, which is necessary to pro- 
vide the high trace-gas abundance we observe. Therefore, although the 
agreement between numerical models and observations is encouraging 
in terms of reversal timing and approximate reproduction of key atmo- 
spheric features such as superrotation and zonal jets, our results show 
that it is critical for the next generation of models to be extended to 
higher altitudes to fully capture Titan’s dynamical behaviour. 

If the circulation does extend to 600 km altitude or more then the 
detached haze layer observed around 450-500 km cannot mark the top 
of middle-atmosphere circulation, as has been previously assumed. 
Recent seasonal changes in the altitude of the detached haze” provide 
a strong argument for circulation-induced modification of the haze, 
but our observations show that an origin for the detached haze in terms 
of dynamical transport and a coincident peak haze-production alti- 
tude* cannot provide a complete explanation. Instead, an origin in a 
transition from monomer to fractal haze particles’, combined with 
higher-altitude haze production and subsequent modification by 
dynamical circulation, is required. Such high-altitude circulation also 
needs a driving mechanism, implying that solar heating of haze and 
cooling due to infrared emission from trace gases is important at 
higher altitudes than previously thought. 

Therefore, a consistent picture of Titan’s middle and upper atmo- 
spheres is now emerging. Complex chemistry occurs in the uppermost 
atmosphere, as evidenced by heavy ions and molecules detected by 
Cassini’s in situ instruments*’** and ultraviolet observations of haze 
opacity in the thermosphere”, in broad agreement with active thermo- 
spheric photochemistry predicted by the most recent one-dimensional 
photochemical models***°. Our measurements show that the radiative 
effects of this complex chemistry are sufficient to drive dynamics up to 
very high altitudes, effectively linking chemical and dynamical pro- 
cesses well into the thermosphere (>500 km). 
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Observing the drop of resistance in the flow of 


a superfluid Fermi gas 


David Stadler’, Sebastian Krinner', Jakob Meineke’, Jean-Philippe Brantut! & Tilman Esslinger! 


The ability of particles to flow with very low resistance is charac- 
teristic of superfluid and superconducting states, leading to their 
discovery in the past century’”. Although measuring the particle 
flow in liquid helium or superconducting materials is essential to 
identify superfluidity or superconductivity, no analogous measure- 
ment has been performed for superfluids based on ultracold Fermi 
gases. Here we report direct measurements of the conduction pro- 
perties of strongly interacting fermions, observing the well-known 
drop in resistance that is associated with the onset of superfluidity. 
By varying the depth of the trapping potential in a narrow channel 
connecting two atomic reservoirs, we observed variations of the 
atomic current over several orders of magnitude. We related the 
intrinsic conduction properties to the thermodynamic functions in 
a model-independent way, by making use of high-resolution in situ 
imaging in combination with current measurements. Our results 
show that, as in solid-state systems, current and resistance measure- 
ments in quantum gases provide a sensitive probe with which to 
explore many-body physics. Our method is closely analogous to the 
operation ofa solid-state field-effect transistor and could be applied 
as a probe for optical lattices and disordered systems, paving the way 
for modelling complex superconducting devices. 

Over the past decade, cold atoms have emerged as a many-body 
system with a uniquely high level of control’. Experiments have shown 
that interacting atomic Fermi gases, analogous to electrons in a solid, 
can display superfluidity*. The equilibrium properties of such gases 
have been measured with increasing precision** and the superfluid 
character of the ground state has been investigated via the response to 
external perturbations’ and the direct observation of vortices’®, in the 
same way as for Bose-Einstein condensates'"*. Using new techniques 
to create and observe directed currents in a closed atomic circuit'® or 
between two large reservoirs’’, it is now possible to study the transport 
properties of mesoscopic systems that are directly analogous to elec- 
tronic devices"*. 

Here we investigated the conduction properties of strongly inter- 
acting fermions flowing through a quasi-two-dimensional, multimode 
channel, which connects two atomic reservoirs. Going beyond our 
previous work’’, we now obtained full control over the atomic current 
by tuning a repulsive gate potential in the channel. The gate potential 
was created by an off-resonant laser beam, as illustrated in Fig. 1. In 
analogy with an electronic field-effect transistor, this gate potential 
controls the chemical potential in the channel while keeping the tem- 
perature imposed by the reservoirs unchanged. With the gate potential 
as a control parameter, we measured the current through the channel 
over a large dynamic range and related it to the observed density in the 
channel region. This allowed us to observe the onset of superfluid flow 
of strongly interacting fermions. We compared these measurements to 
the case of weakly interacting fermions. In our experiment, the current 
established in the channel was a response to the longitudinal per- 
turbation induced by a difference in chemical potential between the 
two reservoirs. This is complementary to experiments probing the 
response of isolated atomic clouds to transverse excitation via rota- 
tion’® or shear”. 


Our experiments were performed with strongly and weakly inter- 
acting quantum degenerate gases of fermionic °Li atoms, equally popu- 
lating the lowest two hyperfine states. To obtain a strongly interacting 
gas, the atoms were placed in a homogeneous magnetic field of 834 G 
where the s-wave scattering length diverges and leads to the formation 
of pairs, while a weakly interacting gas was studied at a field of 475 G 
(see Methods and Supplementary Information). The atoms were 
radially confined in the x-z plane by an optical dipole trap oriented 
along the y-axis with a 1/e” beam radius of 22(1) jum; here and else- 
where, the value in parentheses is the 1-o error of the last significant 
digit. Along the y-direction, the curvature of the magnetic field yielded 
a harmonic confinement with a frequency of m, = 2m X 32(1) Hz. To 
engineer the reservoirs, we split the cloud into two parts using a repul- 
sive laser beam at a wavelength of 532 nm that points along the x- 
direction (beam not shown in Fig. 1). The intensity profile of this beam 
has a holographically imprinted nodal line along the y-axis. As a result, 
achannel in the x-y plane was formed, which confined the atoms along 
the z-direction with a centre trap frequency of 2.9 kHz. The gate poten- 
tial was created by another laser beam at 532 nm that was sent along 
the z-axis onto the channel and had a waist of 18 um. We refer to the 
maximum of the repulsive potential created by this beam as the gate 
potential U. Along the z-axis, a high-resolution microscope objective 
was used for in situ absorption imaging of the atoms in the channel. 
The atom number in the reservoirs was measured by absorption 
imaging along the x-direction. By creating an atom number imbalance 
between the two reservoirs, we created a chemical potential bias that 
induced a current through the channel’’. 

The inset to Fig. 2a presents an example of the time evolution of the 
relative number imbalance between the two reservoirs, measured for 
strongly interacting (red) and weakly interacting (blue) fermions, 
using the same gate potential of U=525(50)nK. For the strongly 
interacting gas, an exponential fit yielded a decay time of 0.057(7) s, 


Gate beam 


Drain 


Figure 1 | Principle of the experiment. Two atomic reservoirs (source and 
drain) are connected by a quasi-two-dimensional conducting channel. Anatom 
number imbalance Nieg > Nrignt between source and drain drives an atom 
current through the channel, indicated by the arrows. A repulsive laser 

beam (gate beam) propagating along the z-axis is focused on the channel. It 
creates a repulsive potential with a gaussian envelope and a tunable amplitude. 
The lighter region in the channel indicates the reduced density due to the 
repulsive potential. 
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which is more than one order of magnitude faster than the decay time 
of 0.70(6) s obtained for the weakly interacting gas. 

The reservoirs can be considered to be in quasi-thermal equilibrium 
during the entire decay, provided this process is sufficiently slow com- 
pared to the thermalization dynamics within the reservoirs. Thus we 
interpret the exponential decay of the imbalance as a resistance mea- 
surement through a tunable channel with resistance R. This is ana- 
logous to the discharge of a capacitor with a fixed capacity C where 
the decay time is t = RC. In our system C is the compressibility of the 
reservoirs, which remains constant as the gate potential is varied”. The 
natural timescale to which we compare the decay time is provided by 
@,, the frequency of the overall harmonic confinement along the 
y-axis. Therefore, we defined a dimensionless resistance r= RCo, 
which is shown in Fig. 2a as a function of the gate potential U. For 
decreasing gate potential the weakly interacting Fermi gas (blue) shows 
a decrease of resistance reaching a minimum value of r ~ 35 for zero 
gate potential. For high gate potentials the resistance for both inter- 
action strengths are comparable, yet the strongly interacting gas (red) 
showed a much faster drop of resistance below 0.7 1K. At a gate poten- 
tial of 0.23(2) UK the resistance differed by a factor of about 25 from the 
weakly interacting gas. As r approaches unity (below 0.23 uK) the 
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Figure 2 | Conduction properties through the channel. Red and blue data 
points correspond to the strongly and weakly interacting gas, respectively. 

a, Dimensionless resistance r as a function of gate potential U. The data points 
shown are those for which the decay is exponential. The inset to a shows a decay 
of the relative atom number imbalance between source and drain as a function 
of time with a gate potential U = 525(50) nK, where N is the total number of 
atoms and AN is the difference in atom number. The solid lines are exponential 
fits with fixed offset of 0.04 for the red curve to account for a small remaining 
imbalance in the reservoirs. b, Atom current as a function of the gate potential 
U. A large increase of the current appears for the strongly interacting gas 
below U ~ 0.7 uK. The inset to b shows the atom current in logarithmic 
scale. The shaded region indicates the maximum current allowed by the 
internal dynamics of the reservoirs (see main text). The error bars show the 
statistical errors. 
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decay time t became equal to the timescale of the internal dynamics 
of the reservoirs, set by the trap frequency along the y-direction. In this 
regime, we cannot interpret our strongly interacting data sets in terms 
of a resistance measurement because the reservoirs do not remain in 
thermal equilibrium at each point in time, that is, the resistance drops 
below our measurement capabilities. This gives rise to deviations from 
the exponential decay. 

In addition to the resistance, we also estimated the current through 
the channel using a linear fit to the initial part of the decay (see 
Methods). This measurement does not rely on the thermalization of 
the reservoirs and thus can also be applied to cases where the reservoirs 
are not fully in quasi-thermal equilibrium. Figure 2b shows the current 
Tas a function of the gate potential for the strongly interacting gas (red) 
and the weakly interacting gas (blue). Unlike the weakly interacting 
gas, the strongly interacting gas showed a fast increase of the current 
below 0.7 1K. For the lowest gate potentials the current was limited by 
the conservation of energy. The limit was reached when the potential 
energy introduced by the initial imbalance was fully converted into 
kinetic energy, as for example in undamped dipole oscillations. It is 
represented by the shaded region in the inset to Fig. 2b, where we show 
the current in logarithmic scale. Remarkably, the observed current was 
very close to that limit, meaning that the strongly interacting Fermi gas 
flowed as if there were no constriction or gate potential at all. This is the 
expected behaviour of a superfluid. 

Although the current depends on the atomic density in the channel, 
the transport properties are characterized in a density-independent 
way by the drift velocity. To extract this quantity, we first used high- 
resolution in situ imaging to measure the atomic line density mj. in 
the channel. The measured line density as a function of the gate poten- 
tial is shown in the inset to Fig. 3. As expected from its higher com- 
pressibility*”°, the strongly interacting gas reached larger line densities. 
For each value of the gate potential, we then divided the measured 
current by the corresponding line density, yielding the drift velocity. 

The drift velocities as a function of gate potential are presented in 
Fig. 3. The drift velocity for the weakly interacting gas showed almost 
no variations. In contrast, the drift velocity for the strongly interacting 
gas increased significantly below U = 0.7 uK. This demonstrates that 
the large increase of the current, seen in Fig. 2, was not simply caused 
by the higher density of the strongly interacting gas in the channel and 
reveals a change in the nature of the transport process. It cannot be 
explained by a transition of the gas from ballistic to classical hydro- 
dynamic behaviour because even for large gate potentials the mean free 
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Figure 3 | Density-independent conduction properties through the 
channel. Red and blue data points correspond to strongly interacting and 
weakly interacting atoms, respectively. Drift velocity is plotted as a function 
of gate potential. The points corresponding to the three highest values of the 
gate potential are omitted in the weakly interacting case because the density is 
almost zero. The inset shows line density 1;,- measured in situ in the channel as 
a function of gate potential (see Methods). The error bars represent statistical 
errors (one standard deviation). 
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path remains well below the channel size. On the other hand, at low 
gate potentials, that is, at low T/T, where T; is the Fermi temperature, 
Pauli blocking of interparticle collisions in a normal gas should restore 
the ballistic behaviour*’** (see Methods). This is in contrast to our 
observations, supporting our superfluidity interpretation. Whether 
another mechanism can lead to better conduction properties than that 
of a perfect ballistic conductor is unclear. 

It was instructive to compare the drift velocity to characteristic 
velocities of the superfluid flow. A Landau-type critical velocity pro- 
vides a value which is expected to be of the order of the Fermi velocity 
for strongly interacting fermions*. We estimated the Fermi velocity 
in the channel from the column density at zero gate potential, which 
gives about 50mms_’, twice as large as the measured drift velocity. 
Additionally, the creation of vortices in the fluid provided a lower 
critical velocity, giving rise to a finite resistance. This velocity can 
be roughly estimated from the channel geometry and the healing 
length using energetic arguments'’, and yielded about 5mms '. 
The observed drift velocity was significantly larger than this critical 
value. This would explain the low but finite resistance observed even in 
the superfluid state, where the decay of the number imbalance is fast 
but remains exponential. 

We next related the conduction properties to a thermodynamic 
parameter by replacing the gate potential scale, which is specific to 
our system, by the thermodynamic potential. To this end, we used the 
high-resolution images of the gas in the channel, which gave us access 
to the equation of state**. The gas in the channel is in the crossover 
regime between two and three dimensions, where the equation of state 
naturally relates the column density n,, to the chemical potential**”° 
(see Methods). From the in situ absorption images of the channel for 
different gate potentials we obtained n..)(U) at fixed temperature, 
which is imposed by the reservoirs. Integrating this relation over the 
known variations of the gate potential yielded the thermodynamic 


potential Q(U) = f Neoi(V)dV which would be equal to the pressure 
U 


in a purely two-dimensional gas. We normalized Q by the pressure of 
a two-dimensional ideal Fermi gas at zero temperature Q9 = mh’ n2,, /m 
and obtained a model-independent thermodynamic scale, analogous to 
the three-dimensional situation discussed in ref. 8. This allowed us to 
convert the gate potential into a thermodynamic quantity, even though 
the gas in the channel was not expected to be in the universal regime* 
owing to the strong confinement’, where most of the thermometry 
techniques cannot be applied directly”*. 

The drift velocity as a function of reduced thermodynamic potential 
is shown in Fig. 4. The strongly interacting gas (red) show a pro- 
nounced increase of drift velocity below Q/Qp ~ 1, indicating the onset 
of superfluidity. This illustrates the high sensitivity of transport mea- 
surements to many-body effects in strongly correlated quantum gases. 
For higher /Qp the blue and red data sets show a constant drift 
velocity. The inset to Fig. 4b presents the resistance as a function of 
Q/Qy for the strongly interacting Fermi gas. Here, we observed a very 
rapid decrease of the resistance for low values of 2/Qo. We interpret 
this as the counterpart of the drop of resistance observed in super- 
conductors. Measurements of the equation of state of a unitary Fermi 
gas in three dimensions have shown that the transition takes place for a 
critical reduced thermodynamic potential of 0.55 (ref. 8). Even though 
our channel is in the crossover between two and three dimensions, we 
observed the change in the conduction properties at around the value 
of the two-dimensional reduced thermodynamic potential (black 
dashed lines in Fig. 4). 

Our experimental geometry is reminiscent of weak links in super- 
conductors’ and the experiment probes transport in a channel that 
is long compared to the coherence length. The coherence length of a 
strongly interacting superfluid is of the order of the interparticle spa- 
cing*, which is below a few micrometres in the channel and smaller 
elsewhere. The length and energy scales of our experiment mean that 
we operate in a dissipative regime complementary to the coherent 
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Figure 4 | Conduction properties as a function of thermodynamic potential. 
Drift velocity is plotted as a function of the reduced thermodynamic potential 
Q/Qo for the strongly interacting (red) and weakly interacting (blue) Fermi gas. 
The inset shows dimensionless resistance as a function of Q/Q, in logarithmic 
scale for the strongly interacting gas, showing the drop of resistance. The 
dashed black lines at Q/Qo = 0.55 indicate the position where the superfluid 
transition in three dimensions occurs. Error bars represent statistical errors 
(one standard deviation). 


tunnelling encountered in Josephson junctions*””*. Our set-up allows 
the investigation of superfluidity and supercurrents in a variety of 
configurations by projecting a designed potential through the micro- 
scope onto the channel”. This opens the way towards the cold-atom 
modelling of complex, superconducting devices. 


METHODS SUMMARY 


A balanced mixture of the two lowest hyperfine states of °Li is prepared by all- 
optical evaporation. Final temperatures are <0.1Tp (strongly interacting gas, 
6.7 X 10* atoms) and approximately 0.37; (weakly interacting gas, 4.5 X 10* 
atoms). For the strongly interacting gas the evaporation is performed at a magnetic 
field of 795 G (scattering length 3,500dp, where dp is the Bohr radius), then the field 
is adiabatically ramped to 834G, at the s-wave Feshbach resonance. The weakly 
interacting gas is cooled at 300G, then the field is ramped to 475 G (scattering 
length -100a9). The trap frequency along the y-axis is w, = 2m X 32(1) s' and 
@y = 2m X 25(1) s | for the strongly and weakly interacting gases, respectively. To 
induce an atom current, we create a number imbalance between the two reservoirs 
by shifting the trapping potential along the y-direction with a magnetic field 
gradient of 0.25 Gcm *. After switching off the gradient within 10 ms, we monitor 
the decay of the number imbalance. The number imbalance and the total atom 
number are obtained from absorption images along the x-axis. For all data, we fit a 
line to the first four points of a measured decay curve of the relative number 
imbalance. We define the current as the fitted slope times half the total number 
of atoms in both reservoirs at equilibrium. To measure the column density 1,,, as 
well as the line density at the centre of the channel for different gate potentials, we 
take in situ absorption images of the channel through the high-resolution micro- 
scope in the absence of current. We apply light pulses of 5 1s and a saturation 
of about 0.1. The local density approximation gives the equation of state’ 
Neoi(Ho — V), where [lo is the chemical potential imposed by the reservoirs and 
V is the local gate potential. Integrating this equation over the gate potential leads 
to the thermodynamic potential. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cloud preparation. A quantum degenerate Fermi gas is prepared by all-optical 
evaporation of a balanced mixture of the two lowest hyperfine states of °Li. 
Evaporation is performed at a magnetic field of 795 G (where the scattering length 
is 3,500dp, and do is the Bohr radius) down to a trap depth of 880 nK. This produces 
a Bose-Einstein condensate of molecules. Then the trap depth is increased to 
2.6 UK in order to stop the evaporation, and the magnetic field is adiabatically 
ramped up to 834 G, where the broad s-wave Feshbach resonance is positioned. 
The curvature of this Feshbach field sets the trap frequency along the y-axis at 
(@®y = 2m X 32(1) Hz. We obtain a strongly interacting Fermi gas of about 6.7 X 10* 
atoms with a temperature T<0.1T , where Ty is the Fermi temperature”’. We 
determine the chemical potential (11 ~ 0.8 1K) of the strongly interacting gas by 
measuring the size of the cloud in the trap. The weakly interacting Fermi gas is 
prepared using the same evaporation ramp at a magnetic field of 300G. The 
magnetic field is then ramped up adiabatically to 475 G (w, = 2m X 25(1) Hz) 
where the scattering length is —100a9. This yields atom numbers of about 
4.5 X 10* at T~ 0.37. We keep the scattering length at a small but finite value 
to ensure that the reservoirs remain at equilibrium during the measurement’’. 
Current generation and measurement. During evaporative cooling we create a 
number imbalance between the two reservoirs by having the trapping potential 
shifted along the y-direction, away from the centre position of the channel. The 
shift is created using a magnetic field gradient of 0.25Gcm™ along the y-axis. 
Restoring the symmetry of the potential in 10 ms creates an atom number imbal- 
ance in the symmetric trapping configuration. This leads to a potential imbalance, 
inducing the atom current. To infer the atom number imbalance, as well as the total 
atom number, the number of atoms in each reservoir is measured by absorption 
imaging along the x-axis. This is done for variable time delays. Each measurement 
is repeated three times and averaged to reduce the noise. In addition to the expo- 
nential fit, we fit a line to the first four points of a measured decay curve of the 
relative number imbalance. We define the current J as the fitted slope multiplied by 
half the total atom number in both reservoirs at equilibrium. For the case where the 
decay is exponential we checked that fitting a line and an exponential gives the 
same current within the error bars. 

Equilibrium density of the gas. In the absence of current, we take in situ absorp- 
tion images of the cloud through the high-resolution microscope. We use light 
pulses of 5 1s with an intensity of about 0.1 of the saturation intensity. We extract 
the line density of the cloud by counting the total number of atoms in a region of 
18 1m along the y-axis at the centre of the channel, over which the trap frequency 
along the z-axis varies by less than 10%. The variations of column density along the 
x-axis are measured by counting the atom number in patches of length 18 pm in 
the y-direction, and 2.4 ttm in the x-direction. From the known waist of the dipole 
trap—22(1) um—we infer that the change of chemical potential within one of 
those patches is lower than 13.7%. All in situ pictures are averaged 20 times to 
reduce the noise. In addition, the gate beam profile is directly imaged through the 
same optical system, yielding a map of the gate potential. 

Thermodynamic potential. For each power setting of the gate beam, the in situ 
column density along the x-axis is processed in seven patches to yield a set of 


curves M<o(V), where V is the local gate potential in the corresponding patch. In 
the local density approximation, these curves belong to the same equation of state 
(because the confinement along the z-axis is the same in all patches). The curves 
are combined using the hypothesis that regions having the same column density 
have the same chemical potential, giving the equation of state’ Meoi(Lo — V). Here 
Ho is the unknown chemical potential imposed by the reservoirs. Integrating this 
equation of state from V to the largest gate potential (for which density is zero) 
gives the thermodynamic potential as a function of U for a fixed (but unknown) 
temperature. By normalizing the thermodynamic potential to that of an ideal 
two-dimensional Fermi gas with the same column density, we obtain the ther- 
modynamic scale that is used for Fig. 4. 

Confinement-dominated regime in the channel. Inside the reservoirs, the size of 
the superfluid pairs on the Feshbach resonance is 2.6/kp ~ 0.6 ttm (ref. 30). This 
length scale is of the order of the size of the ground state of the harmonic oscillator 
for atoms in the channel, \//i/(ma,)~0.8 jum. Therefore, even for the lowest gate 
potentials, we expect the pairing mechanism in the channel to be influenced by the 
confinement. As the gate potential is increased, the density in the channel 
decreases, so the expected pair size, being inversely proportional to the Fermi 
wavevector on the Feshbach resonance, increases, and the gas acquires a more 
and more pronounced two-dimensional character. 

Hydrodynamic behaviour of the strongly interacting Fermi gas. We estimate 
the mean free path between collisions for the gas at a magnetic field of 834G 
and compare it to the length of the channel to evaluate the hydrodynamic char- 
acter of the strongly interacting gas. We first consider the limit of low density, 
that is, large T/T,, at high gate potential. Using a two-dimensional ansatz for 
the gas at high gate potential and following ref. 26, we estimate the collision 
rate Q=hngp|f|"/m from the scattering amplitude f, which depends only on 
the two-dimensional density nzp (via the Fermi energy) and the confinement 
when the three-dimensional scattering length diverges”. The mean free path is 
given by the Fermi velocity divided by the collision rate, yielding /<2 jm for 
Np > 0.01 wm”. With our channel length of around 20 jum, the gas is hydrodyn- 
amic down to the lowest observed densities. In the opposite limit of low gate 
potentials and high density, Pauli blocking of collisions is expected to increase 
the mean free path in a classical hydrodynamic gas, eventually making the gas 
ballistic. We assume the gas to be in the three-dimensional regime, which yields 
the unitarity-limited scattering cross-section given by o=4n/kg, with kp the 
Fermi wavevector. The mean free path is given by != 1/on3p, with m3p the 
three-dimensional density. This yields /~ 1 jum for n3p~2 jm ~?. Pauli blocking, 
however, reduces the scattering cross-section propotional to (T/ Ty)? = 0.01 
(ref. 31) for our case, leading to a mean free path of the order of the channel size 
or even larger. 


30. Schunck, C. H., Shin, Y.-i., Schirotzek, A. & Ketterle, W. Determination of the 
fermion pair size in a resonantly interacting superfluid. Nature 454, 739-743 
(2008). 
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A canonical stability-elasticity relationship verified 
for one million face-centred-cubic structures 


Sascha B. Maisel!, Michaela Hofler’ & Stefan Miiller! 


Any thermodynamically stable or metastable phase corresponds to 
a local minimum of a potentially very complicated energy land- 
scape. But however complex the crystal might be, this energy land- 
scape is of parabolic shape near its minima. Roughly speaking, the 
depth of this energy well with respect to some reference level deter- 
mines the thermodynamic stability of the system, and the steepness 
of the parabola near its minimum determines the system’s elastic 
properties. Although changing alloying elements and their concen- 
trations in a given material to enhance certain properties dates 
back to the Bronze Age’”, the systematic search for desirable prop- 
erties in metastable atomic configurations at a fixed stoichiometry 
is a very recent tool in materials design’. Here we demonstrate, 
using first-principles studies of four binary alloy systems, that 
the elastic properties of face-centred-cubic intermetallic com- 
pounds obey certain rules. We reach two conclusions based on 
calculations on a huge subset of the face-centred-cubic configura- 
tion space. First, the stiffness and the heat of formation are nega- 
tively correlated with a nearly constant Spearman correlation‘ for 
all concentrations. Second, the averaged stiffness of metastable 
configurations at a fixed concentration decays linearly with their 
distance to the ground-state line (the phase diagram of an alloy at 
zero Kelvin). We hope that our methods will help to simplify the 
quest for new materials with optimal properties from the vast 
configuration space available. 

To the best of our knowledge, the only publication that successfully 
presented a full configurational optimization as a tool to systematically 
search for metastable phases harder than their respective ground states 
was by Yuge’. The author reported metastable boron-carbon nitride 
configurations with bulk moduli almost as high as that of pure dia- 
mond. Because both heat of formation and bulk modulus are functions 
of the carbon concentration, the author concluded that a very natural 
correlation existed between heat of formation and elastic properties— 
ultimately, this was a result of the dependence of both quantities on the 
concentration. Does any correlation remain if we restrain ourselves to 
one specific concentration? We shall make two claims and verify them 
on a subset of more than one million face-centred-cubic (f.c.c.) struc- 
tures. These structures have been sampled from the configuration 
space of four characteristic binary f.c.c. intermetallics, all of them tech- 
nologically relevant materials for a multitude of applications: nickel- 
aluminium near the Ni;Al stoichiometry (the primary precipitating 
phase constituting the y’-phase in nickel-based superalloys), Ni-Ta 
(another vital precipitor in Ni-rich alloys), Cu-Al (a very common 
light-weight stainless alloy) and nickel-rich Ni-W (a high-temperature 
alloy for extreme conditions). These four are characteristic of the 
family of the f.c.c. binary alloys. However, our approach could readily 
be used to test both claims in systems with covalent’ or ionic bonds, or 
in systems that do not crystallize on the f.c.c. lattice. 

Claim I is as follows: for any subset X,, of the f.c.c. configuration 
space with constant concentration x = const, the heat of formation 
AH(c) of structure o and its elastic stiffness Cj are negatively corre- 
lated. Specifically, for all structures ¢ € X,, both averaged tetragonal 
stiffness C77 and shear stiffness Cjq are almost monotonic functions of 


the enthalpy excess f/ = (AH(y) — AH(a))/(AH(y)) of structure o with 
respect to the structure with lowest AH at the respective concentra- 
tion. The dimensionless enthalpy excess / is a very useful quantity in 
our case, where several alloys have been studied and the actual values 
for AH vary (see Supplementary Information) between 0.43 meV (y'- 
Ni3Al) and 10 meV (for CuAl) per atom. Hence, expressing the elastic 
properties as functions of f instead of AH allows for a direct compar- 
ison of the correlation in the four investigated intermetallics. The 
claimed correlation can be quantified via the Spearman correlation* 
coefficient, p. This quantity takes on values between —l1=p=1, 
where a Spearman correlation of p(a, b) = 1 implies that a(b) is a 
strictly monotonic increasing function of b with da(b)/db > 0 and vice 
versa. On the other hand, a Spearman correlation of p(a, b) =-1 
implies a strictly monotonic decrease (anti-correlation) with éa(b)/ 
0b <0. Using Spearman’s p, claim I can be condensed into a single 
equation: p(f,Cji) <0. 

From a mathematician’s point of view, claim I states that the depth 
of the potential well is not independent of the derivatives of the energy 
parabolas near its local minima. The actual magnitude of p(f,q7) and 
p(B.ca4) will quantify the degree of correlation, the quality of mono- 
tony and (together with the number of structures in the subset X,) 
the likeliness of outliers. From a materials scientist’s point of view, a 
successful proof of claim I will immediately raise a plethora of ques- 
tions about the exact form of the correlation. Consider the problem of 
manufacturing a material with certain mechanical properties—a task 
required since the dawn of metal casting®. One would be bound to ask 
if the correlation severely restricts the usefulness of dipping into non- 
equilibrium states when trying to grow a very hard crystal. Also, the 
nature of the elasticity-stability relationship could be susceptible to 
exploitation if the task at hand is to soften a material. This has been 
successfully performed in several commercial titanium alloys**, which 
have been softened by inducing a metastable body-centred-cubic 
(b.c.c.) phase which is softer than the ground state. This effectively 
reduces the stiffness to the point where the alloy is soft enough to serve 
as a material for hip implants without damaging the adjacent bones’. 
This typically involves a lattice change from structures based on hexa- 
gonal close packing to b.c.c.-based B2-type structures, and thus a 
change of the underlying lattice*. Our findings here support the sug- 
gestion that a similar softening could be achieved without causing 
lattice changes. 

To verify claim I, we have exhaustively enumerated at least 1.7 X 10° 
f.c.c.-based configurations for each of the four alloys up to a unit cell size 
of 21 atoms. The enumeration has been performed using an algorithm 


Table 1 | Cross-validation scores S., for all 12 cluster expansions 


Score Alloy 

y'-NiAl Cu-rich CuAl NiTa NiW 
So(AH) (eV) 0.94103 026x103 252x10°% 592x103 
Sev(Ciz) (GPa) 1.71 5.71 2.34 3.21 
Scv(Caa) (GPa) 1.51 0.62 1.09 2.25 


Scores are quantitative measures of the predictive power of these expansions. 


1Hamburg University of Technology, Institute of Advanced Ceramics, DenickestraBe 15, 21073 Hamburg, Germany. 
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Figure 1 | Ground-state diagrams of binary Ni-W. a, b, Colour coding shows 
@i(a) and ¢44(b). Each point in the diagrams corresponds to one specific 

geometric arrangement o of atoms with given tungsten concentration on the 
lattice sites of a face-centred cubic-lattice. Their respective formation enthalpies 


presented in refs 9 and 10, and yields a grand total of 1,405,970 struc- 
tures. Then, density functional theory has been applied to a 
representative subset of configurations for each of the four systems. 
These calculations have been performed using the Vienna Ab-Initio 
Simulation package (VASP)'"”’; computational details and input para- 
meters can be found in Methods. Using the data sets of density func- 
tional energies, an extrapolation of the energies of the remaining 
enumerated structures has been performed using the cluster expansion 
technique, as realized in our UNCLE package’’. The cluster expan- 
sion approach uses an analytical decomposition of the many-body 
interaction inherent in any crystal. Any observable that is strictly a 
function of the atomic configuration o can be expanded into a sum over 
many-body correlation functions /7; times some expansion coefficients 
Jr; which has been proven'*"’. Because both the symmetrically averaged 
elastic moduli Gj and the heat of formation AH per atom are functions 
of the atomic structure o, the cluster expansion method is ideally suited 
to expand both the mechanical properties and the energetics: 


m(a)= S$ Jy e(0) (1) 
F 


tu(o) = Jf Meo) (2) 
F 
AH(o) = —xa(o)E, —x(0)Es+ )~ Jy Ip(a) (3) 
F 


where E, and Ep are the free energies per atom of the pure elements, of 
which structure o contains a fraction x, of component A and xg of 
component B. Using equations (1)-(3), a grand total of 12 expansions 
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and elastic properties are deduced from the cluster-expansions (equations (1)- 
(3)) and indicated on the y axis and the colour-coding, respectively. Similar 
diagrams for binary Ni-Ta, Cu-rich Cu-Al and y'-Ni-Al and can be found in 
Supplementary Information. 


have been carried out. The average deviation calculated using a cross- 
validation score S., for each expansion is given in Table 1. This score 
Scy is a measure of the predictive power of a cluster expansion”. 
Figure 1b shows a ground-state diagram of binary Ni-W, where the 
averaged shear stiffness Cyq has been colour-coded into the diagram. 
The very stiff structures depicted in yellowish colours are predomi- 
nantly found near the ground-state line, while the very soft structures 
are realized at high formation enthalpies and therefore at high enthalpy 
excess f. Similar diagrams can be found for both the tetragonal 
modulus and the shear stiffness for all four intermetallic systems; see 
Supplementary Information. 

The data set used to generate Fig. 1 can be used to verify claim I by 
directly calculating Spearman’s coefficient p([,¢j) for a large sample 
size (Fig. 2). As claimed, both p(f,ci1) and p(,ca4) are strictly nega- 
tive for all systems and all compositions. Hence, the stiffness is a 
(quasi) strictly decreasing function of the enthalpy excess f of the 
structures ¢ in X, over which ¢j(c) is sampled. We note that the 
correlation coefficient is specific for the material and the respective 
elastic property, and does not vary much with concentration. This 
behaviour appears to be generic, which suggests that Spearman’s p is 
a quantity characteristic for each ordered phase in intermetallic alloys. 
As Spearman’s p only quantifies the degree of correlation without 
actually fixing the functional relation, this immediately prompts the 
formulation of another claim. 

Claim II is as follows: let X,.. be the set of all f.c.c. structures with 
enthalpy excess B and x(o) = x. Let (Ca(a)) be the averaged shear 
modulus averaged over the set Xp, ,. Then, the harmonic (Voigt-type) 
average 
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Figure 2 | Spearman’s coefficients p(f,c11) and p(f,c44) for the correlation 
between stiffness ¢jj and enthalpy excess £ a, p( C11); b, p(B,Caq). The x axis 
shows concentrations of Al in copper-rich Cu-Al (red crosses), Ta in binary Ni- 
Ta (green crosses), W in binary Ni-W, and Al in y’-Ni-Al. The correlation is 
strictly negative (anticorrelation) for all systems and compositions. Note that 
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Spearman’s p is nearly constant for a given material and choice of i. The 
constant fits have been determined by generating ten sets of interaction 
coefficients in equations (1)-(3) for each system with subsequent calculation of 
the arithmetic mean. 
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Table 2 | Decay of stiffness as a function of enthalpy excess f for five face-centred cubic alloys 


Structures and fits Alloy 

y'-NizsAlas NigoTazo0 Cug3.aAlies CugoAlzo NigoW20 
Structures 29,649 11,500 1,474 8,763 9,753 
Fit residual (C77) (GPa) 0.841 2.478 4.352 2.661 2.79 
Fit residual (Caq) (GPa) 0.433 0.958 0.568 0.470 0.485 
6(C71) / OB (GPa) —76.76 + 0.07 —34.56 + 0.08 —50.04 + 0.69 —38.92+0.17 —24.72 + 0.06 
0(Caa) / 0B (GPa) —16.09 + 0.04 —61.00 + 0.05 -—19.56+0.17 —25.15+0.05 —19.05 + 0.03 


Stiffness is given by @(¢j;)/0B, and corresponds to the slopes of the linear fits in Fig. 4. Harmonic averaging has been used for the thermal averaging denoted by ). 


(Caa) (B,x) = a“ 


Csa() 
over N different structures will decrease approximately linearly as a 
function of the enthalpy excess f in the vicinity of the ground-state line 
0=f<1.A similar functional relation will hold for the average of 
11 (B,x). The thermal averaging denoted by () is crucial, because at 
finite temperatures, all structures with equal enthalpy excess f (p) = 
const are in coequal competition. 

To validate claim II, we have analysed (17) (, x) and (C44) (f, x) for 
five different f.c.c. systems (Table 2). The five data sets used for the 
averaging process have been generated from the previous cluster 
expansions. Thus, the five systems correspond to five columns at speci- 
fic concentrations in the diagrams of Fig. 1 and Supplementary Fig. 1. 
To illustrate this, Fig. 3 shows Cag (f, x) for NiggW 9 and Ni;Al before 
the thermal averaging. 

Figure 4 shows the results of averaging over all structures at constant 
fh according to equation (4) for the five compositions in Table 2. 
According to claim II, the decay of 11(f) and C44(f) as a function 
of the enthalpy excess f is roughly linear for each of the five systems 
and Fig. 4 clearly confirms that. As a quantitative confirmation, fit 
residuals are supplied in Table 2, and are typically only a few GPa. 

From an application-oriented perspective, the successful verifica- 
tion of claim II allows us to predict the odds of finding the desired 
properties in structures with given enthalpy excess f. Touching again 
on the example of the Ti-based hip joints mentioned above—the shape 
of Fig. 4 suggests that more in-depth research on the relationships 
presented here will allow materials to be tuned to the desired stiffness 
in similar circumstances, where an application requires a material with 
exactly the right elastic properties. The verification of claim II also 
establishes that looking for very stiff metastable phases should always 
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Figure 3 | Gq asa function of the enthalpy excess # before averaging over all 
structures at constant # Each coloured dot corresponds to one real-space 
structure. All structures with fixed enthalpy excess are in coequal competition. 
Roughly 1,500 structures out of a Nig9W29 subset (red squares) of the f.c.c. 
configuration space are shown, together with 35,000 structures out of a Ni7;Als 
subset (green circles) of that configuration space. 
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commence at the ground-state line. With increasing f > 0, it will be 
less and less rewarding to search for atomic configurations with a high 
stiffness—the steepest parabolas and hence the stiffest specimens are 
encountered in the stable and nearly stable phases. 

Although all alloy compositions show a more or less linear decay of 
(i) (B,x) and (C4) (f,x), the slopes 0¢j;/0f vary significantly from 
material to material. Most notably, the shear stiffness C4q of the binary 
Ni-Ta alloy decreases rapidly away from the ground-state line: the 
044 /0f values for Ni-Ta are the highest of all the materials we inves- 
tigated. From a materials scientist’s perspective, this corresponds to a 
system which loses its mechanical stability quickly, if the crystal struc- 
ture is not near its thermodynamic equilibrium. Assuming that the 
proposed linear behaviour is generic, systematic tabulation of 0¢;/0f 
for different systems appears to be a worthwhile endeavour for both 
crystallographic and application-driven purposes. 
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Figure 4| Averaged stiffnesses using harmonic (Voigt-type) averaging for 
four different systems at five different compositions. a, (¢11)(,x); 

b, (Gia) (f.x). Key shows systems and compositions examined. Note that a 
change of composition from the CugpAl alloy to the stiffer Cug3 4Ali6.6 merely 
leads to a nearly parallel shift of the stiffness curves, without drastically 
changing the functional relation or the slopes. 
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In this work, we have offered a systematic large-scale analysis of the 
elasticity-enthalpy dependence for f.c.c.-based metal-metal alloys 
only. However, the methods that we have used to demonstrate these 
correlations and relationships could be used to investigate whether 
these claims also hold for other systems. Several indications of a con- 
figuration dependence of, for example, bulk moduli have been given in 
the literature for practically all classes of materials—not only inter- 
metallic alloys'*’”, but also glasses'® and even carbides’. Further work 
is required to verify or falsify our findings for other base lattices and 
different classes of materials. 


METHODS SUMMARY 


The four Hamiltonians have been generated using the cluster-expansion approach. 
The method is introduced elsewhere'*”” and the actual implementation used is the 
UNCLE package"*. The basis set of the expansion consists of real-space correlation 
functions. The expansion coefficients of the four different cluster expansions have 
been obtained by fitting to density functional theory input calculations, employing 
an evolutionary approach using genetic algorithms’’”®. The first-principles input 
required by the genetic algorithms has been provided using the VASP package"""””. 
The exchange-correlation functional of the DFT calculations was described within 
the general gradient approximation as parameterized in ref. 21. Specifically, the 
PBE-parameterized PAW-GGA-potentials*’” as supplied with VASP have been 
employed. All structures have been geometrically fully relaxed. Integration 
within the Brillouin zone has been performed using I’-centred k-meshes with 
up to 23 X 23 X 23 grid points depending on supercell size, employing the usual 
Methfessel-Paxton smearing” for the relaxations and the Bléchl-corrected tetra- 
hedron method” to determine the energies. After fully relaxed geometries have 
been obtained, the elastic properties were determined by applying the finite dis- 
tortions formalism of VASP using the four-displacement scheme (NFREE = 4) at 
a spacing of 0.018 A. As this procedure requires absolute convergence of the stress 
tensor, convergence tests of the plane-wave energy cut-offs had to be performed for 
all relaxations and distortions. At their conclusion, increased cut-offs of 420 eV 
were chosen for all systems. The concentration ranges of enumerated structures 
using the algorithm introduced in refs 9 and 10 vary from system to system 
(Supplementary Fig. 5). This is because we took particular care to have all struc- 
tures in all systems and at all compositions reside on an f.c.c.-based lattice in one 
single phase, since for some systems, especially NiTa (ref. 25), the elastic properties 
vary significantly between neighbouring phases—an effect that is potentially stronger 
than the decay of stiffness away from equilibrium. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The four Hamiltonians have been generated using the cluster-expansion approach. 
The method is introduced elsewhere'*”” and the actual implementation used is the 
UNCLE package". The basis set of the expansion consists of real-space correlation 
functions. The expansion coefficients of the four different cluster expansions have 
been obtained by fitting the J; in equations (1) to (3) to density functional theory 
input calculations, employing an evolutionary approach using genetic algo- 
rithms'*”*. This evolutionary formalism to construct the expansions also fully 
optimizes the basis set'*, that is, the correlation functions used to parameterized 
the observables c);, c44 and AH. For all systems and all observables, less than 30 
n-point correlation functions were required to obtain a stable expansion, with no 
higher order than 6-point correlations occurring. A total number of 50 structures 
for Ni-W, 55 for Ni-Ta, 88 for Ni-Al and 27 for CuAl have been used for the 
input data base, yielding a grand total of 220 structures from which the cluster- 
expansion Hamiltonians have been constructed. Separate expansions have been 
performed for c,;, ¢4and AH. The cross validation scores of the twelve expansions 
can be found in Table 1. The first-principles input required by the genetic algo- 
rithms has been provided using the VASP package'*. The exchange-correlation 
functional of the DFT calculations was described within the general gradient 
approximation as parameterized in ref. 21. Specifically, the PBE-parameterized 
PAW-GGA-potentials”’”* as supplied with VASP have been employed. For the 
representation of tantalum, a PAW-potential including full treatment of p-states 


has been employed. All structures have been geometrically fully relaxed. Inte- 
gration within the Brillouin zone has been performed using I'-centred k-meshes 
with up to 23 X 23 X 23 grid points depending on supercell size, employing the 
usual Methfessel-Paxton smearing” for the relaxations and the Bléchl-corrected 
tetrahedron method™ to determine the energies. After fully relaxed geometries 
have been obtained, the elastic properties were determined by applying the finite 
distortions formalism of VASP using the four-displacement scheme (NFREE = 4) 
at a spacing of 0.018 A. Since this procedure requires absolute convergence of the 
stress tensor, convergence tests of the plane-wave energy cut-offs had to be per- 
formed for all relaxations and distortions. At their conclusion, increased cut-offs 
of 420 eV were chosen for all systems. The concentration ranges of enumerated 
structures using the algorithm introduced in refs 9 and 10 vary from system to 
system (see Supplementary Fig. 5). This is because we took particular care to have 
all structures in all systems and at all compositions reside on an f.c.c.-based lattice 
in one single phase, since for some systems, especially NiTa (ref. 25), the elastic 
properties vary significantly between neighbouring phases—an effect that is 
potentially stronger than the decay of stiffness away from equilibrium. The 
post-processing of the data to obtain Spearman’s p was done using GNU 
Octave scripts, using the function call ‘spearman()’ as implemented in Octave 
version 3.2.4. The linear fitting in Fig. 4 and the calculation of the slopes was 
performed using classical least-squares minimization. To this end, GNU Gnuplot 
v4.4 has been employed. 
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Rapid coupling between ice volume and polar 
temperature over the past 150,000 years 
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Current global warming necessitates a detailed understanding of 
the relationships between climate and global ice volume. Highly 
resolved and continuous sea-level records are essential for quantify- 
ing ice-volume changes. However, an unbiased study of the timing of 
past ice-volume changes, relative to polar climate change, has so far 
been impossible because available sea-level records either were dated 
by using orbital tuning or ice-core timescales, or were discontinuous 
in time. Here we present an independent dating of a continuous, 
high-resolution sea-level record’? in millennial-scale detail through- 
out the past 150,000 years. We find that the timing of ice-volume 
fluctuations agrees well with that of variations in Antarctic climate 
and especially Greenland climate. Amplitudes of ice-volume fluctua- 
tions more closely match Antarctic (rather than Greenland) climate 
changes. Polar climate and ice-volume changes, and their rates of 
change, are found to covary within centennial response times. 
Finally, rates of sea-level rise reached at least 1.2 m per century dur- 
ing all major episodes of ice-volume reduction. 

During the past few million years, variability in global ice volume 
(sea level) has been one of the main feedback mechanisms in climate 
change (see, for example, refs 3, 4). However, detailed assessment of 
the role of ice volume in climate change is hindered by inadequacies in 
sea-level records and/or their chronologies. First, dated coral sea-level 
benchmarks are discontinuous before the last glacial maximum (LGM; 
~22,000 years ago). Second, continuous sea-level records have insuf- 
ficient chronological control; they rely on orbital tuning, correlations 
with ice-core records, or imperfect transfer of coral datings'**’. 
Orbital tuning assumes a systematic response between changes in ice 
volume and Earth’s orbital parameters, so that the relationship 
between insolation forcing and global ice volume cannot be discerned 
from orbitally tuned records. In addition, the timing of any centennial- 
scale to millennial-scale fluctuations in ice volume will be poorly con- 
strained in orbitally tuned sea-level records because the shortest orbital 
frequency is ~ 19,000 years. Transferring an ice-core chronology to a 
sea-level record requires an assumption that ice volume always varies 
in a systematic phase relationship with either Antarctic or Greenland 
climate, which may not be the case (see, for example, refs 1, 2, 8, 9). 

We resolve these issues for the past 150,000 years using a novel 
approach to provide a detailed chronology to the continuous and 
highly resolved record of Red Sea relative sea-level (RSL)*. We exploit 
a ‘basin isolation’ concept, similar to that used for the Red Sea’”, in the 
nearby eastern Mediterranean, where marine sediments can be dated 
much more accurately. Because the hydrological cycle directly links the 
5'%O of eastern Mediterranean surface waters and that of cave spe- 
leothems on bordering land masses downwind of this highly evaporat- 
ive sea’®"!, we can directly relate our new high-resolution planktonic 
foraminiferal 5'*O record for the surface-dwelling species Globi- 
gerinoides ruber (white form) in eastern Mediterranean sediment core 
LC21 (5'SO;uber) to the U-Th-dated Soreq Cave speleothem 8'°O 
record (8'°Ospeteo) (Fig. 1; Methods and Supplementary Information). 


Previous work demonstrated quantitatively that eastern Mediter- 
ranean 5'°O has a strong overprint of sea-level variability’; hence, 
considerable agreement is expected between 8'¥O signals for the Red 
Sea and the eastern Mediterranean (Supplementary Information). 
Although the more complicated hydrological cycle in the Mediter- 
ranean (relative to the Red Sea) means that variations in eastern 
Mediterranean 5'%O cannot be used to determine the amplitudes of 
sea-level change precisely, the basin isolation effect imposes sufficient 
5'°O signal similarity between the two seas to allow accurate transfer of 
the superior eastern Mediterranean chronology to the Red Sea record. 
This is achieved using our new 5'*O record from core LC21 for the 
subsurface-dwelling planktonic foraminifer Neogloboquadrina pachy- 
derma (dextral) (0 Opaals which is known to minimize surface-water 
5'%O overprints (Fig. 1, Methods and Supplementary Information). 

Construction of the new RSL chronology involves two stages. First, 
we build an age model for eastern Mediterranean core LC21 by cor- 
relating its 8° Ovuber record with the Soreq Cave OP Ovsina record over 
the interval 40-160 kyr Bp (Fig. la and Supplementary Information). 
For the interval 0-40 kyr Bp, our age model is constrained by 14 radio- 
carbon datings and two well-documented and independently dated 
tephra horizons from the Minoan’? and Campanian Ignimbrite 
(CI)** eruptions. A Bayesian depositional model (constructed using 
OxCal"®) comprising the original Soreq Cave chronology, the '*C dat- 
ings and the chronostratigraphic position of all Soreq—LC21 tie-points 
(Supplementary Information) then improves the accuracy of the Soreq 
Cave chronology and '*C datings, and consequently that of core LC21, 
and rigorously determines the chronological uncertainties of the LC21 
tie-points. Next, we transfer the new LC21 age model to the RSL record 
between 22,000 and 150,000 years ago using the 8'*O,,. record 
(Fig. 1b; Supplementary Information). For the younger interval (0- 
22,000 years ago), we correlate RSL with a recent sea-level probability 
curve based on radiometrically dated sea-level benchmarks’; this is a 
more direct correlation target than a @ eee for this interval (Fig. 1b). 
Our correlations reveal that Last Interglacial (LIG) sea levels peaked 
before the main (monsoonal) wet phase in the eastern Mediterranean 
(Fig. 1b). This is stratigraphically corroborated within Red Sea core 
KLO9, in which runoff-related soil biomarkers appear after the LIG 
highstand signal’’ (Fig. 1b). 

Age uncertainties are quantified for all correlations to allow full 
error propagation into the new RSL chronology. A root-mean-squares 
estimate at the 95% (2c) probability level is calculated that fully 
accounts for errors associated with sample-spacing in the ae eee 
8 Ones: O "Opes and RSL records, as well as the analytical error 
associated with the Soreq Cave U-Th and LC21 eC datings, and the 
20 confidence interval of the sea-level probability curve (Supplemen- 
tary Information). We reinforce this by categorizing our chosen tie- 
points into three levels of confidence: category 1 is considered the most 
reliable and within the bounds of sample-spacing, category 2 tie-points 
may be moved by +0.5kyr, and category3 tie-points are the most 
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Figure 1 | Correlation of Soreq Cave and eastern Mediterranean (LC21) 
510 signals and the Red Sea RSL record. a, New planktonic foraminiferal (G. 
ruber) 5'8Oyuber record from core LC21 (black), Soreq Cave 3" Ospeteo record 
(red) (Supplementary Information) and tie-points (red diamonds) used to 
correlate the LC21 and Soreq Cave records. Error bars (+0.5 and +1 kyr) 
denote more ambiguous tie-points (Supplementary Information). Also 
indicated are '*C datings (black diamonds), the Minoan and Campanian 
Ignimbrite (CI) tephra horizons (dashed black lines) and intervals of sapropel 
deposition (grey rectangles). b, Red Sea RSL record’ (dark blue, 1-kyr moving 


contentious and may be moved by +1 kyr (Fig. 1). On the basis of the 
total error of each RSL tie-point, we interpolate a 20 age uncertainty for 
every data point in the RSL record (Supplementary Information). 
Finally, these age uncertainties are combined with methodological 
sea-level uncertainties (+12 mat the 2¢ level’) in a probabilistic assess- 
ment of the RSL record (Fig. 2 and Supplementary Information). This 
reveals that, during the LIG, RSL at Hanish sill (gateway to the Red Sea) 
stood above 0 m at 126-130 or 120-133 kyr Bp (95% confidence limits 
to the maximum-probability RSL (RSLpmax) and RSL data points, 
respectively), and peaked at 127-129 or 126-132 kyr Bp (95% confid- 
ence limits to RSLpmax and RSL data points, respectively; Fig. 2). 
Although the depth of Hanish sill is implicit in the Red Sea sea-level 
method, the timing and magnitude of LIG sea levels in our RSL record 
may be expected to differ from eustatic sea level (ESL) as a result of 
isostatic effects at the sill’*; our datings are therefore likely to be refined 
by detailed isostatic modelling of the sill. 

We now compare our probabilistic sea-level curve with other key 
records of sea level and high-latitude climate (Fig. 2). Our new RSL 
record agrees well—within uncertainties—with coral sea-level bench- 
marks (Fig. 2b). Discrepancies in Marine Isotope Stages 5e and 5a may 
relate to uncertainties in coral position or depth habitat, tectonic/iso- 
static effects among sites, and/or isostatic effects at Hanish sill'®. 

In general, our sea-level record agrees well with variability in ice 
volume suggested by deep-sea benthic foraminiferal 5'°O, and with 
the major (orbital-scale) climate transitions recorded in Greenland 
and Antarctic ice cores (Fig. 2). Exceptions to this broad coherence 
are termination I in the benthic foraminiferal 5'°O record? of marine 
core MD95-2042 from the Iberian margin (Fig. 2d), and termination II 
and the Marine Isotope Stages 4-3 transition in a global benthic 6'°O 
stack"” (Fig. 2c). Given that RSL agrees well with all other proxy records 
over these transitions, we surmise that these offsets are due to orbital 
tuning, lower sample resolution’”” and potential bias from deep-sea 
temperature changes’ and isostatic effects. 


Gaussian filter) after correlation with the LC21 (N. pachyderma) 3"Opac record 
(green) and with a highest-probability sea-level curve’ (light blue). Correlation 
tie-points (green and light blue diamonds, with error bars as in a) and 
termination II (TI) are indicated. For a Mediterranean 8'°O (5'8OMeq) 
enrichment of 2.5 + 0.5%o per 120 m sea-level change"’, RSL was converted into 
equivalent 5'8Oxrea values (orange shading). A LIG wet phase in the Red Sea 
(about 124-128 kyr Bp; black bar) is also indicated’. RSL is not reliable for 
about 14-23 kyr Bp because of an aplanktonic zone in Red Sea sediments 
(Supplementary Information). 


Given that the eustatic glacial—interglacial sea-level range is impli- 
citly accounted for in the Red Sea sea-level method, RSL is a good 
approximation of variations in ESL (ice volume)', although it may 
underestimate ESL variability by as much as 10% (ref. 18). 
Regarding polar climate variations, the structure and amplitude of 
Antarctic climate variations agree well with the record of highest- 
probability ice-volume fluctuations (Fig. 2e). This corroborates pre- 
vious observations'*° but, crucially, is more conclusive because our 
new chronology is entirely independent of ice-core age models. The 
Antarctic-RSL relationship is most tenuous at about 95-115 kyr Bp, at 
which RSL instead agrees better with Greenland climate fluctuations. 
Indeed, the timing of ice-volume changes is generally found to be close 
to that of Greenland climate variability (Fig. 2f). However, higher- 
amplitude Greenland 5'*O oscillations (‘Dansgaard—Oeschger’ events”’) 
generally exceed concomitant ice-volume variability. In summary, at the 
maximum probability and 95% confidence levels for RSL, the timing and 
structure of large-scale sea-level variability reflects a global signature of 
climate changes recorded in both Antarctic and Greenland ice cores. 

Phase relationships between changes in polar climate and ice 
volume are initially evaluated by lagged correlations between the ice- 
core data (European Project for Ice Coring in Antarctica (EPICA) 
Dronning Maud Land (EDML) 3'8O and North Greenland Ice-core 
Project (NGRIP) 3/80) and RSL (Supplementary Information). We 
find that ice-volume lags of 100-400 and 200-400 years produce the 
best correlations with Antarctic and Greenland climate changes, 
respectively (Fig. 3a, b and Supplementary Information). Rates of 
change in ice volume and in polar climate correlate most strongly 
within +200 years (Fig. 3a, b). Further assessment with cross-spectral 
analyses of the EDML 3'80, NGRIP 5'0, and RSL records (and their 
derivatives) confirms that, for suborbital frequencies, peak coherences 
between Greenland climate and ice volume are associated with minimal 
phase offsets (+300 years), whereas phase offsets between Antarctic 
temperature and ice volume are potentially larger (400-700 years; 
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Figure 2 | Comparison of probabilistic 
assessment of RSL with other sea-level 
reconstructions and with Antarctic and 
Greenland climate variability. Confidence 
intervals of 95% for the RSL data (light grey) and 
probability maximum (dark grey) (Supplementary 
Information) are superimposed on: a, Red Sea RSL 
data on our new chronology (orange crosses; black 
line, 1 kyr moving Gaussian filter); b, coral sea-level 
data (+2¢) (blue, green’’, red”®, black’’); ¢, a 
global benthic foraminiferal 5/80 stack!? (pink); 
d, benthic foraminiferal 5'°O record (five-point 
running mean) from marine core MD95-2042 

L (ref. 5) (purple); e, 580 record (seven-point 
running mean) from EPICA Dronning Maud Land 
(EDML)”** (blue) and 5D record (seven-point 
running mean) from EPICA Dome C (EDC)”’ 
(orange); and f, NGRIP 8180 record (five-point 
running mean)*° (green). The MD95-2042 5'0 
record is plotted here on the Greenland Ice Core 
Chronology (GICC05 (ref. 31) for 0-60 kyr Bp, and 
on the NGRIP (2004) chronology”? for 60- 

85 kyr bP, after synchronizing the co-registered 

| (MD95-2042) planktonic foraminiferal 5'8O 
record with Greenland 5'%O variations. EDML 
5'8O and EDC 8D are plotted on the EDML1/ 
EDC3 timescale. NGRIP 5180 is plotted on the 
GICC05 timescale for 0-60 kyr Bp, and on its 
original timescale for 60-122 kyr Bp (ref. 30). RSL is 
less reliable for about 14-23 kyr BP because of poor 
sampling resolution through an aplanktonic 
interval, and is therefore not shown. Marine 
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Figure 3 | Lagged correlations of Antarctic and Greenland climate versus ice 
volume (sea-level), and rates of sea-level change over the last full glacial 
cycle. a, b, Regression coefficients (r) (Supplementary Information) are plotted 
for the highest-probability sea-level curve and the smoothed (s) EDML and 
NGRIP 5'80 records (RSLpmax» EDMLs, NGRIPs; black squares, left-hand 

y axes) and for their first derivatives (€RSLpmax/dt, dEDMLs/dt, dNGRIPs/dt; 
green squares; right-hand y axes) for the regressions EDMLs versus RSLpmax 
and dEDMLs/dt versus dRSLp,ax/dt (a) and NGRIPs versus RSLpyax and 
dNGRIPs/dt versus dRSLpmax/dt (b). Negative values of ice-volume lag 
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correspond to changes in ice volume leading changes in polar climate. 
Optimum correlations are indicated (black and green arrows). ¢, RSLpmax (grey 
shading), RSL data (blue crosses) and probability maximum of the first 
derivative of RSL (red) with 95% confidence interval (pink shading). Rates of 
sea-level change of +12 and —8mkyr ' are indicated (dashed lines). Red 
arrows mark peaks in sea-level rises of more than 12 mkyr ' at 10.9- 
11.8kyr BP, 37.4-37.5 kyr BP, 61.2-61.6 kyr BP, 85.5-86.9 kyr BP, 108.1- 

108.8 kyr BP and 132.1-133.8 kyr Bp. Data from the Red Sea aplanktonic interval 
(about 14-23 kyr BP) are omitted. 
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Supplementary Information). We infer that Greenland climate closely 
tracks and/or is directly coupled with ice-volume changes, whereas 
Antarctic climate variability may lead ice-volume changes by up to 
700 years (Supplementary Information). 

Our new RSL chronology permits the first robust calculation of rates of 
relative sea-level change throughout the past 150,000 years (Fig. 3c). This 
reveals that rates of sea-level rise reached at least 1.2 m per century during 
all major phases of ice-volume reduction, and were typically up to 0.7m 
per century (possibly higher, given the smoothing in our method) when 
sea-level exceeded 0 m during the LIG (Fig. 3c); the latter is consistent with 
independent estimates’. Rates of sea-level lowering rarely exceeded 
0.8m per century. Any differences between rates of change in ESL and 
RSL at Hanish Sill are likely to be captured within our uncertainties. 

We have characterized and dated a continuous record of ice-volume 
variability throughout the last glacial cycle in a manner that is entirely 
independent of assumptions about the orbital insolation forcing of 
climate or about glaciological processes. We have also shown that, 
on suborbital timescales, polar climate and ice-volume changes were 
closely coupled in a quasi-direct phase relationship (within centuries). 
Our analyses hint that Antarctic climate change leads global ice- 
volume change by several centuries, which is a realistic timescale for 
ice-sheet adjustments**. Greenland climate, however, is found to 
change virtually simultaneously with ice volume, which may suggest 
a link of Greenland temperature to ice-volume change in the Northern 
Hemisphere through albedo feedback. 


METHODS SUMMARY 


For the Soreq Cave record, we present 440 new U-Th datings that were acquired 
by multi-collector inductively coupled plasma mass spectrometry at the 
Geological Survey of Israel (Supplementary Information). Sample ages were cor- 
rected for detrital *°Th if *°Th/***Th activity ratios were less than 160; for 
*°Th/?"Th activity ratios of more than 160-200 the correction factor was well 
within age uncertainties. Typical age uncertainties (20) are less than 1 kyr (0- 
35 kyr sp), less than 1.5 kyr (35-70 kyr Bp), less than 2.5 kyr (70-120 kyr Bp) and 
less than 4kyr (120-160kyrsp). Dating methods are further described in 
Supplementary Information. We applied Bayesian age modelling’ to all U-Th 
datings (+2¢), which typically reduced uncertainties to less than 0.5kyr (0- 
60 kyr Bp), less than 1 kyr (60-90 kyr Bp) and less than 2 kyr (90-160 kyr Bp) and 
had only minor impacts on absolute ages (generally less than 250 and less than 
750 years for 0-70 and 70-160 kyr Bp, respectively). 

For the eastern Mediterranean record, continuous u-channel samples from the 
pristine archive half of core LC21 (35° 40’ N, 26° 35’ E; cruise MD81) were sub- 
sampled at 1-cm intervals. Stable isotope analyses of about 15-30 cleaned, similar- 
sized tests of G. ruber (white form) and N. pachyderma (dextral) from the greater 
than 300-11m and 150-300-|1m sieved sediment fractions, respectively, were per- 
formed at the National Oceanography Centre, Southampton, on a Europa 
Geo2020 mass spectrometer fitted with an individual acid-bath carbonate pre- 
paration line. Standards (NBS-19 and an in-house Carrara marble) were run every 
17 samples; external precision is less than 0.06%. 
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Development of teeth and jaws in the earliest jawed 


vertebrates 


Martin Riicklin', Philip C. J. Donoghue’, Zerina Johanson’, Kate Trinajstic?’, Federica Marone’ & Marco Stampanoni”’® 


Teeth and jaws constitute a model of the evolutionary devel- 
opmental biology concept of modularity’ and they have been con- 
sidered the key innovations underpinning a classic example of 
adaptive radiation’. However, their evolutionary origins are much 
debated. Placoderms comprise an extinct sister clade’ or grade** to 
the clade containing chondrichthyans and osteichthyans, and 
although they clearly possess jaws, previous studies have suggested 
that they lack teeth®*, that they possess convergently evolved 
tooth-like structures’"' or that they possess true teeth’?. Here 
we use synchrotron radiation X-ray tomographic microscopy 
(SRXTM)”* of a developmental series of Compagopiscis croucheri 
(Arthrodira) to show that placoderm jaws are composed of distinct 
cartilages and gnathal ossifications in both jaws, and a dermal 
element in the lower jaw. The gnathal ossification is a composite 
of distinct teeth that developed in succession, polarized along three 
distinct vectors, comparable to tooth families. The teeth are com- 
posed of dentine and bone, and show a distinct pulp cavity that is 
infilled centripetally as development proceeds. This pattern is 
repeated in other placoderms, but differs from the structure and 
development of tooth-like structures in the postbranchial lamina 
and dermal skeleton of Compagopiscis and other placoderms. We 
interpret this evidence to indicate that Compagopiscis and other 
arthrodires possessed teeth, but that tooth and jaw development 
was not developmentally or structurally integrated in placoderms. 
Teeth did not evolve convergently among the extant and extinct 
classes of early jawed vertebrates but, rather, successional teeth 
evolved within the gnathostome stem-lineage soon after the origin 
of jaws. The chimaeric developmental origin of this model of mod- 
ularity reflects the distinct evolutionary origins of teeth and of 
component elements of the jaws. 

Possible scenarios for the evolutionary origin of teeth and jaws have 
been influenced heavily by chondrichthyans, in which teeth develop 
within a deep dental lamina, producing files of replacements pre- 
formed long ahead of their functional deployment®. However, chon- 
drichthyans are not primitive jawed vertebrates* but, along with 
osteichthyans, represent crown-group gnathostomes. Therefore, the 
pattern of tooth development and replacement in living chondrichth- 
yans does not necessarily reflect the nature of the earliest jawed 
vertebrates. The extinct placoderms are the most primitive jawed ver- 
tebrates known, comprising either a monophyletic sister lineage to 
crown gnathostomes’ (Fig. 1a) or, more persuasively, a primitive grade 
of jawed vertebrates that includes a succession of sister lineages to 
crown gnathostomes** (Fig. 1b). As such, placoderms are crucial to 
resolving the early evolution of teeth and jaws. 

The nature of the dentition in placoderms has been the subject of 
debate that can be reduced to semantic differences, with some authors 
advocating a structural diagnosis that identifies teeth in placo- 
derms''”” and others adhering to the use of developmental criteria 
that preclude the presence of teeth in placoderms*~*. Intuitively, devel- 
opmental definitions cannot be applied to fossil material, but the 


pattern of skeletal development is preserved in the sclerochronology 
of growth-arrest lines and the polarity of cell lacunae and canaliculi in 
mineralised skeletal tissues. So far, however, understanding of placo- 
derm jaw and dental development has been limited largely to inference 
from surface morphology’, with only one direct study of development’”. 
This is because traditionally analyses have been destructive. We used 
SRXTM” to study jaw, dental and dermal skeletal development in onto- 
genetic stages of the arthrodire Compagopiscis croucheri, selected because 
it is known from abundant articulated three-dimensional remains; 
the smallest and largest specimens known were included in our study. 

The lower jaw of Compagopiscis is comprised of the infragnathal 
that rests on the Meckel’s cartilage which is ossified only at its proximal 
(articular) and distal (mentomeckelian) extremes (Supplementary Figs 
1 and 2). Tomographic data reveal that the infragnathal is composed 
of two principal ossifications, the bony shaft of the jaw (axial ossifica- 
tion) and a distal compound dental ossification (Fig. 2). We digitally 
extracted successive stages of growth in the infragnathal, revealing the 
sequence of development of the dental ossification (Figs 2d, e, g, h 
and 3). Growth proceeded through the addition of cusps, proximally, 
distally and lingually relative to a primordial cusp. Initial addition 
consists of single cusps and subsequent cusps are associated with the 
growth of an increasingly expansive sheet of tissue that in later devel- 
opmental stages extends ventrally around the bony shaft of the infra- 
gnathal and, in the largest infragnathal, partially around the Meckel’s 
cartilage (Figs 2 and 3). In most cases, these sheets of tissue are con- 
tinuous from the proximal to distal rows of cusps (Fig. 3), indicating 
coordinated growth. More cusps are added to the proximal row than to 
the distal and lingual rows during each growth episode. However, the 


Hagfishes b 


—_ 


Lampreys =_ 


Osteostracans 


SUUOPODE|4 


oS 


Ghondrichthyans 


the 


ord Acanthodians 
Acanthodians .- “ 

»_/ petinopterygians 
Sarcopterygians ROS 

Figure 1 | Evolutionary relationships of principal groups of vertebrates. 

a, b, Phylogenetic relationships among the principal groups of stem and crown 


gnathostomes. The traditional view of placoderm monophyly’’”* (a) versus the 
more recent hypothesis of placoderm paraphyly* (b). 
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Figure 2 | Lower-jaw element of Compagopiscis croucheri, Late Devonian 
period, Australia. a-c, microCT data; d-i, SRXTM data. Volume rendering of 
jaws and teeth (a-c, i) and surface cut (d-h). Ontogenetic sequence in lateral 
view (a) NHMUK PV P.50948, (b) NHMUK PV P.50943 and WAM 91.4.3 
(c). Teeth and jaw ossifications WAM 91.4.3: horizontal section (d, e), vertical 
section (f), longitudinal section (g, h) and labelled sclerochronology as virtual 
dental ossification (transparent) and bony shaft (shaded, i). Scale bar in 

a represents 2mm (a-c), 1 mm (d, h, i), 240 jum (f), 400 um (e, g). 


individual cusps are clearly successional (not synchronous) even 
within each growth episode and tooth row (Figs 2d-g and 3). The 
cusps are composed of dentine; they have distinct pulp cavities lined 
with centripetally nested tissue layers permeated by radially arranged 
and polarized canaliculi, but not cell lacunae (Figs 2e, g and 4a). The 
pulp cavity of each cusp is infilled progressively through ontogeny 
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Figure 3 | Virtual development of a Compagopiscis croucheri lower jaw, 
Late Devonian period, Australia. Labelled sclerochronology in the dental 
element of the lower jaw of WAM 91.4.3. Subsequent ontogenetic stages follow 
from top left to top right in an anti-clockwise direction. Darker shades 
represent the addition of developmental stages of teeth and sheets of tissue. 
Scale bar, 1 mm. 


such that the earliest formed cusps are completely infilled. The struc- 
ture and pattern of development of the supragnathals (Fig. 4a), which 
attach directly to the roof of the oral cavity, are comparable to the 
dental ossification of the infragnathal. 

The pattern of development of the dental ossification in Com- 
pagopiscis is compatible with observations made in other arthrodiran 
placoderms. For example, we can corroborate the identification of 
distinct dental and axial ossifications comprising the infragnathal of 
Plourdosteus, as well as the coordination of proximal and distal cusp 
development that was inferred but could not be observed in the same 
study’. However, we find no evidence for the widespread resorption 
and remodelling in Compagopiscis that was inferred for Plourdosteus'’. 
An axial and dental ossification, along with sequential cusp develop- 
ment in a proximal row, is present in the infragnathal of buchanosteids 
(Fig. 4b), which are distantly related arthrodires”, indicating that these 
characteristics are primitive features of arthrodires. Distinct gnathal 
and dermal jaw ossifications seem to be primitive for placoderms more 
generally, as a cusp-bearing capping structure occurs associated with a 
proximal axial ossification in rhenanids’*. An equivalent of the dermal 
axial ossification in arthrodires occurs also in antiarchs (Fig. 4c), in 
which the secondary absence of dentine cusps from the infragnathal 
mirrors their secondary absence from the dermal skeleton’®. However, 
it may alternatively reflect a primitive absence of teeth deep within the 
paraphyletic placoderm grade of stem-gnathostomes*”. 

The structure of the individual cusps and their pattern of sequential 
development within the gnathal elements in Compagopiscis and other 
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Figure 4 | Histological comparison of the Compagopiscis croucheri cusps 
on jaws, dermal bone and postbranchial lamina, with the jaws and pectoral 
fins of the antiarch Bothriolepis species, both Late Devonian, Australia, and 
the jaw of a buchanosteid arthrodire, Early Devonian, Saudi Arabia. 

a-g, Volume rendering of microtomography data (small insets; a, b), volume 
rendering of SRXTM data (small insets; c-f), surface-cuts showing SRXTM 
images (large insets; d, e, g) and virtual thin sections of SRXTM images 

(a-c, f). Cusps on jaws of Compagopiscis croucheri, proximal upper jaw (posterior 
supragnathal) National History Museum London (NHMUK) PV P.57629, small 


placoderms, support comparisons to gnathostome teeth that have been 
made previously based on surface morphology”""'. This interpretation 
has been contested on the grounds that tooth-like cusps are not dia- 
gnostic of teeth, evidenced by the fact that even in placoderms such as 
Compagopsicis, comparable tooth-like cusps also occur in association 
with the cranial dermal skeleton*”*. To test this comparison we exam- 
ined the structure and development of the dermal skeleton. The 
Compagopiscis dermal skeleton is similar to that of other placoderms 
that have been investigated'®, in that they are composed of a basal 
division of lamellar bone, a middle division of cancellar bone, and in 
the superficial division of compact bone with surface tubercles. 
However, unlike the morphogenetically distinct cusps associated with 
the gnathals, the superficial tubercles of the dermal skeleton are 
revealed to be focal developments of continuous sheets of bone that 
are morphogenetically integrated with the underlying dermoskeleton 
(Fig. 4d). Tooth-like structures associated with the dermal pectoral fin 
spines of antiarch placoderms could be an exception; when isolated, 
these structures can be mistaken for complete jaws® (Fig. 4e). However, 
our analysis reveals that the tooth-like margin of the pectoral spines 
is again characteristic of the dermoskeleton, comprised collectively 
of continuous sheets of bone, not from morphogenetically distinct 
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inset NHMUK PV P.50943 (a), buchanosteid arthrodire, lower jaw, Muséum 
National d’Histoire Naturelle Paris MNHN.F.ARB 239 (b) and Bothriolepis sp. 
lower jaw NHMUK PV P.50898 (c). Tubercles on dermal plate (marginal) of 
Compagopiscis croucheri NHMUK PV P.50945 (d), marginal cusps on pectoral 
fin of Bothriolepis sp. NHMUK PV P.50898 (e) and tooth-like cusps on 
postbranchial lamina of Compagopiscis croucheri NHMUK PV P.5255.6 

(f, g). Scale bar in g represents 500 ,1m (a, small inset f), 200 um (g, small 

inset a), 285 tum (b), 5 mm (small inset b), 167 tum (c-e), 2mm (small 

inset c), 400 {tm (small inset d), 333 [um (small inset e), 143 um (f). 


elements as in the gnathals (Figs 2 and 4a, b). Evidently, structural 
objections to the identification of teeth in placoderms are unfounded. 

It has been suggested that the placoderm dentition fails to meet the 
definition of a gnathostome tooth because there is no evidence that 
they develop from a deep and continuous dental lamina***. However, 
living jawed vertebrates show great diversity in dental development, 
with teeth developing in deep or shallow positions, from continuous to 
discrete epithelial pockets, that persist through life or atrophy and 
develop anew’’’; the plesiomorphic conditions for jawed vertebrates 
and crown gnathostomes are unclear. The discrete teeth in Com- 
pagopiscis and other placoderms developed in a shallow position like 
those of many living osteichthyans. The key distinction in placoderms 
is that the dentition is statodont, that is, teeth are not resorbed, shed 
and subsequently replaced. In this sense, the placoderm dentition is 
most similar to that of ischnacanthid acanthodians”, holocephalans”! 
and lungfish”, in which teeth develop through marginal apposition to 
a compound dental plate. The functional limitation of this approach to 
development is that worn teeth cannot be replaced. The innovation of 
crown gnathostomes is site-specific tooth replacement. 

It has been suggested that successional tooth homologues are 
present also on the postbranchial wall of placoderms, including 
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Compagopiscis®"°, coopted convergently to the jaw among different 
lineages of primitive gnathostomes to serve a tooth function”'®. The 
cusps are arranged into columns and rows reminiscent of the tooth 
families of extant chondrichthyans””, but the hypothesis of sequential 
development is an inference based on surface morphology. Our data 
refute this hypothesis, given that in the dermoskeleton, the rows of 
tooth-like cusps that occur on the postbranchial wall are simple focal 
developments of continuous sheets of spongy bone, added episodically 
to the growing margin of the postbranchial wall (Fig. 4f, g). 

Our evidence indicates that teeth are present even in the earliest 
jawed vertebrates and that within the phylogenetic context of placo- 
derm paraphyly*” they can be identified as homologous to the teeth of 
crown gnathostomes. This contrasts with the hypothesis that teeth 
were absent from the earliest jawed vertebrates, evolved convergently 
through cooption of oral and pharyngeal denticles’”’'. Indeed, our 
tomographic analyses show that the putative tooth-like pattern of 
placoderm pharyngeal denticle replacement bears no resemblance 
to that of their teeth, except in superficial morphology. Thus, the 
hypothesis of a distinct evolutionary origin of teeth and dermal 
denticles”"®** can be rejected, as jawless stem-gnathostomes have been 
shown to lack homologous dental patterning” and the assertion of 
a fundamental embryological distinction between external and oral 
denticles has been refuted*°. Ultimately, teeth and other oral and pha- 
ryngeal denticles must be derived through the extension of the odon- 
togenic capacity of the external dermis to the internal dermis and 
endoderm. However, tooth- and jaw-structure and development is 
evidently less integrated within the placoderm grade of early jawed- 
vertebrate evolution than in derived osteichthyans in which teeth, 
tooth development and jaw structure are intimately interwoven, as 
part of the process of site-specific tooth replacement. Upper and lower 
dental ossifications occur in placoderms, but there is no clear homo- 
logue of the osteichthyan dentary or coronoid. However, the axial 
ossification of the infragnathal can be compared to the inner dental 
arcade of early osteichthyans based on its location relative to the 
underlying Meckel’s cartilage, overlying dental ossification and lateral 
muscle attachment. Compagopiscis and other placoderms evidence an 
early stage in jawed vertebrate evolution in which the components of 
the mandible were fewer and more obviously distinct than they are in 
osteichthyan evolutionary and developmental model organisms. Some 
processes associated with these more derived taxa, such as tooth 
resorption (as a necessary precursor to tooth replacement), are absent 
in placoderms. This stepwise acquisition reflects the fact that character 
complexes like the gnathostome jaw have been assembled over a pro- 
tracted episode of evolutionary history and so the modular construc- 
tion of the mandible’, for example, reflects the disparate evolutionary 
origins of its component modules. 


METHODS SUMMARY 

Museum repositories: National History Museum London (NHMUK); Western 
Australian Museum (WAM); Muséum National d'Histoire Naturelle Paris 
(MNHN). The material was scanned using a SkyScan 1172 Micro-CT scanner 
at the University of Bristol, and using the TOMCAT beamline” of the Swiss Light 
Source (SLS), Paul Scherrer Institut, Switzerland. Slice data were analysed and 
manipulated using Avizo 6.3 (http://www.vsg3d.com). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The fossil material used in this study is housed in the National History Museum 
London, the Western Australian Museum (WAM) and the Muséum Mational 
Histoire Naturelle Paris (MNHN). It is comprised of specimens of 
Compagopiscis croucheri and Bothriolepis species from the Frasnian, Late- 
Devonian-period Gogo Formation of Western Australia and a buchanosteid 
arthrodire from the Emsian, Early Devonian, Jawf Formation of Saudi Arabia. 
The material is acid prepared and was scanned using the SkyScan 1172 micro- 
tomography scanner at the University of Bristol (Figs 2a—c, and 4a, b; small insets) 
and synchrotron radiation X-ray tomographic microscopy (SRXTM)”’, at the 
TOMCAT beamline” of the Swiss Light Source (SLS), Paul Scherrer Institut, 
Switzerland (Figs 2d-i, 3 and 4). Using a X10 objective exposure time at 25 keV 
was 1100 ms, and 3,001 projections were acquired equi-angularly over 360° with 
the rotation axis positioned at the border of the field of view. In this way it has been 
possible to almost double the width of the field of view offered by the X 10 objective 
while preserving spatial resolution (Figs 2d-i and 3). Projections were post-pro- 
cessed and rearranged into flat- and dark-field-corrected sinograms, and recon- 
struction was performed on a 60-core Linux PC farm, using a highly optimized 
routine based on the Fourier transform method and a regridding procedure”. The 
resulting volume, obtained by vertically stacking four tomograms, consisted of 
6,645 X 3,082 X 3,659 voxels, with isotropic dimensions of 0.74 |tm. The other 


SRXTM experiments followed a standard acquisition approach with the rotation 
axis located in the middle of the field of view and the acquisition of 1,501 projec- 
tions equi-angularly distributed over 180°. For the larger specimens, a 4 object- 
ive (resulting pixel size = 1.85 jum) and an energy of 20 keV (Fig. 4a, c, e) or 30 keV 
(Fig. 4f, g) were used. Smaller samples were scanned using a X 10 objective (result- 
ing pixel size = 0.74 um) at either 20 keV (Fig. 4c), 21.5 keV (Fig. 4d) or 30 keV 
(Fig. 4b). The microtomography experiments of the largest complete specimens 
were 360° scans with resulting pixel size of 5 pm at 37 kV (Fig. 2b, cand small inlets 
of Fig. 4a), 65kV (Fig. 2a) and 75 kV (small inlet of Fig. 4b). Slice data were 
analysed and manipulated using Avizo 6.3 (http://www.vsg3d.com). Sectional 
images and ‘virtual thin sections’ were created using maximum intensity projec- 
tions (MIP; the voltex module in Avizo), which simulates the casting of light rays 
from preset sources through a volume of data. Three-dimensional models of the 
different growth stages were derived by labelling manually the sclerochronology as 
slightly different grey-scale volumes. 
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Shifts in rainfall patterns and increasing temperatures associated 
with climate change are likely to cause widespread forest decline in 
regions where droughts are predicted to increase in duration and 
severity’. One primary cause of productivity loss and plant morta- 
lity during drought is hydraulic failure’ *. Drought stress creates 
trapped gas emboli in the water transport system, which reduces 
the ability of plants to supply water to leaves for photosynthetic gas 
exchange and can ultimately result in desiccation and mortality. At 
present we lack a clear picture of how thresholds to hydraulic 
failure vary across a broad range of species and environments, 
despite many individual experiments. Here we draw together pub- 
lished and unpublished data on the vulnerability of the transport 
system to drought-induced embolism for a large number of woody 
species, with a view to examining the likely consequences of climate 
change for forest biomes. We show that 70% of 226 forest species 
from 81 sites worldwide operate with narrow (<1 megapascal) 
hydraulic safety margins against injurious levels of drought stress 
and therefore potentially face long-term reductions in productivity 
and survival if temperature and aridity increase as predicted for 
many regions across the globe**. Safety margins are largely inde- 
pendent of mean annual precipitation, showing that there is global 
convergence in the vulnerability of forests to drought, with all 
forest biomes equally vulnerable to hydraulic failure regardless 
of their current rainfall environment. These findings provide 
insight into why drought-induced forest decline is occurring not 
only in arid regions but also in wet forests not normally considered 
at drought risk”*. 

Sensitivity to drought is fundamentally important in shaping the 
geographic distribution of individual species as well as communities””®. 
Drought has underpinned many large-scale forest mortality events 
over the past century, often in combination with other abiotic and 
biotic factors’"’. Recent evidence suggests rising global temperatures 
are already amplifying drought-induced forest change and affecting 
terrestrial net primary productivity'* '*. The consequences of longer 
droughts and higher temperatures are potentially dramatic. For example, 
rapid forest collapse as a result of drought could convert the world’s 
tropical forests from a net carbon sink into a large carbon source during 
this century*"’. Predicting how forests will respond to future climate 


changes hinges on a quantitative understanding of the physiological 
mechanisms governing drought stress at the species level. One of the 
most promising avenues for characterizing the sensitivity of plants to 
drought stress is by quantifying the strength of the liquid (hydraulic) 
connection between soil and leaves through the water-transporting 
xylem tissue. 

Cavitation, a phase change from liquid water to vapour, occurs in 
plants because water transported through the xylem is under negative 
pressure’®. The resultant air emboli block xylem conduits and reduce 
the plant’s ability to move water from soil to sites of photosynthesis”. 
Recent evidence indicates that the ability of woody plants to survive 
and recover from periods of sustained drought is strongly related 
to their embolism resistance**. This property varies widely among 
species and is largely determined by differences in the structure of the 
xylem’*°. Although xylem structure can acclimate to environmental 
variation during growth and development, subsequent acclimation of 
embolism resistance to environmental stress is not possible because 
xylem conduits are dead at maturity. Embolism resistance therefore 
represents a critically important trait for defining the limits of drought 
tolerance across woody species and predicting drought-induced forest 
decline at regional and global scales. 

In drying soil, stomata initially regulate water loss from the leaves to 
maintain xylem pressure (Y%,; measured as water potential below 0) 
within a range that will protect the xylem from extensive embolism’. 
As drought continues, stomatal closure slows but does not halt the 
decline of xylem pressure and hydraulic capacity. If soil water is not 
replenished before complete hydraulic failure occurs then the plant 
will desiccate and potentially die. The resistance ofa plant to embolism 
is described by the relationship between xylem pressure and loss of 
hydraulic conductivity due to conduit occlusion by gas emboli (Sup- 
plementary Fig. 1). The Y, at which 50% loss of conductivity occurs 
(Wo) is the most commonly used index of embolism resistance. 
When Y, falls below Yo the water transport function of the xylem 
is markedly impaired and the plant is exposed to considerable risk of 
accelerated embolism leading to long-term reductions in productivity, 
tissue damage, and ultimately death”. To examine vulnerability of forest 
biomes to drought-induced hydraulic failure we assembled a database 
of 59 (Supplementary Table 1, including 480 woody species). Site 
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Figure 1 | Minimum xylem pressure as a function of embolism resistance 
for 191 angiosperm and 32 gymnosperm species. The safety margin is the 
distance between each point and the 1:1 (dashed) line. Data were binned in 1.0- 
MPa increments for embolism resistance (50). Bins were pooled with the next 
lowest bin if they contained only one sample. Raw data are shown as smaller 
points behind binned data. Error bars, s.d. Regression lines shown were fitted to 
raw data (angiosperms, r= 0.57, P< 0.0001; gymnosperms, r= 0.59, 
P<0.0001; data set available in Supplementary Table 1). Minin, minimum 
xylem pressure. 


climate varied widely, for example, mean annual precipitation (MAP) 
ranged from 300 to 4,500 mm, and mean annual temperature from —4 
to 27°C. Data for angiosperms (flowering plants) and gymnosperms 
(mainly conifers) were analysed separately because of fundamental 
differences in xylem structure between these two groups. 

We observed a significant (P < 0.0001) linear relationship between 
the minimum Y, measured in plants under natural conditions (Ynin) 
and Ys for both angiosperms and gymnosperms, showing that embo- 
lism resistance is tightly linked with the level of drought stress experi- 
enced by plants across a broad range of environments (Fig. 1). The 
difference between Yi, and 59 represents a highly informative ‘safety 
margin’ within which the plant operates in a given environment”***. This 
safety margin quantifies the degree of conservatism in a plant’s hydraulic 
strategy, indicating that plants with low (or even negative) safety margins 
experience large amounts of embolism and therefore potential risk of 
hydraulic failure (Supplementary Fig. 1). Measurements of Yinin Were 
available for 226 of the 480 species included in our database. 

Across all forest biomes, 70% of all species operated at narrow 
(<1 megapascal (MPa)) safety margins (Fig. 2a), indicating that both 
arid and mesic biomes are vulnerable to drought-induced decline if 
extreme drought events become more frequent as predicted under 
global climate change*®. Applying more conservative safety margins, 
for example, the difference between Yi, and almost complete xylem 
failure (gg), showed a similar convergence of vulnerability across 
forest biomes (Fig. 2b). 

Overall, gymnosperms showed greater hydraulic safety margins 
than angiosperms, with 42% of all angiosperms versus only 6% of all 
gymnosperms operating at negative safety margins, that is, Yimin below 
Yo (Figs 1, 2). The seemingly risky embolism-tolerance strategies seen 
in angiosperms suggest that flowering plants may have a greater capa- 
city to reverse embolism, a process by which gas is dissolved and the 
conduits are restored to a water-filled and functional status. Although 
this process is still poorly understood, it is clear that recovery can only 
occur if periods of drought are followed by sufficient precipitation and 
a return to favourable water status*”’. Therefore, refilling does not 
represent an effective escape strategy for mitigating the effects of severe 
and persistent drought. 


LETTER 


a 10 


A 
e [235 Angiosperms 
8 A Gag Gymnosperms 
e 
6 L 
e a 
a e 
= at . b 
32 3 e b ‘ 
= 2+ g 3 e 
2) 
g Tt 4 rs 
a 0 
> 
e t z | v 
O ob t 
-4} 8 ‘ neeneer 
Increasing mean annual precipitation 
-6 ii T T T 
b 12 
A 
a 
1ok ° 
A 
a, BF 
g $ 
= 6 3 ‘ ° 
8 + ° 
x ® > a 
S 4k e 
il 
> 2r 
o) 
5 0 T r- ¥. a 
e 
“27 - Increasing mean annual precipitation 
-4 : r 1 1 
Mediterranean Temperate Tropical Tropical 
and woodland forest seasonal forest __rain forest 


Figure 2 | Box plot of hydraulic safety margins for angiosperm and 
gymnosperm species across forest biomes. a, b, The so (Yimin — W50) safety 
margin is shown in a (n = 223), and the gg (Wmin — Ygg) safety margin is 
shown in b (n = 222). Boxes show the median, 25th and 75th percentiles, error 
bars show 10th and 90th percentiles, and filled symbols show outliers. 
Gymnosperm species were not represented in tropical forests. Significant 
differences (P< 0.05) between biome means are indicated by letters above 
boxes with angiosperms (lowercase a, b) and gymnosperms (uppercase A) 
considered separately. Data set available in Supplementary Table 1. 


Wider safety margins in gymnosperms than angiosperms do not 
mean that gymnosperms are immune to the threat of hydraulic failure. 
In fact, Pinaceae species have significantly lower embolism resistance 
and safety margins than Cupressaceae’’”®”®, which is reflected in the 
greater frequency of dieback events involving Pinaceae’'’*. In 
the severe 2002-2003 drought, for example, Pinus (Pinaceae) species 
suffered widespread mortality in the south western United States, 
whereas co-existing Juniperus (Cupressaceae) survived’? 

In our data set, 5 was strongly associated with MAP (Fig. 3) such 
that both the mean and upper tenth quantile trends showed decreasing 
resistance to embolism (less negative Y59) with increasing rainfall 
(P < 0.05). Similar relationships were found between Ys9 and climate 
variables that account for both the variation in potential evapotrans- 
piration (PET) and seasonality of precipitation: aridity index (MAP 
divided by PET) and mean precipitation of the driest quarter (Sup- 
plementary Fig. 2). However, a wide range of hydraulic strategies occur 
within any given climate region, with the greatest variation in V5 
occurring at sites with a MAP of between 300 and 1,000 mm. In high 
MAP sites, represented by tropical rainforests in our data set, variation 
is compressed to less negative Yso, suggesting that low embolism res- 
istance is associated with high transport efficiency and low structural 
‘costs’, making this an advantageous strategy in highly productive, wet 
tropical environments’®. 
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Figure 3 | Embolism resistance as a function of mean annual precipitation 
for 384 angiosperm and 96 gymnosperm species. Each point represents one 
species. A generalized model indicated that embolism resistance (Y%59) was 
significantly related (P< 0.00001) to mean annual precipitation (MAP) for 
angiosperms and gymnosperms (see Methods for details), with decreasing 
resistance to embolism corresponding to increasing rainfall. The full data set is 
available in Supplementary Table 1. 


It is clear from Fig. 3 that 5) and MAP are decoupled in certain 
cases, implying that some species growing in drier environments 
escape from water stress, therefore alleviating the need for high embo- 
lism resistance. There are many examples in the literature of species 
with low embolism resistance growing in areas of low rainfall or sea- 
sonal drought. This includes riparian and ground-water-dependent 
vegetation’, and drought-deciduous trees in tropical dry forests”. 
These species avoid very negative Y%, by some combination of pre- 
dictable access to ground water (deep roots), internal water storage and 
reduced leaf area or other shifts in biomass allocation’®*’”*. Although 
these adjustments decouple 59 and Ynin from MAP, it seems that the 
majority of species operate close to their functional limits. They 
are therefore exposed to xylem failure during anomalously low rainfall 
in a manner that is largely independent of rainfall region and biome 
(Fig. 2). 

The convergence on a ‘risky’ hydraulic strategy exhibited by many 
species can be understood as the result of a trade-off that balances growth 
with protection against risk of mortality in a given environment’’*!”. 
Thus, a low safety margin to 50 also indicates that stomatal regulation 
takes full advantage of the range of xylem pressures that are within the 
tolerance of the hydraulic system of that species. This stomatal beha- 
viour carries with it the benefit of increased carbon gain but may come 
at the cost of extensive loss of photosynthetic area or death. 

A fundamental question concerns the plasticity and genetic diver- 
sity of embolism resistance within species*””°. If the tight link 
between embolism resistance and water availability is the product of 
natural selection over many generations and adaptation is limited by a 
long generation cycle of perennial plants, then the rapid pace of climate 
change may outstrip the capacity of populations to adapt. This could 
lead to long-term reductions in net primary productivity of forest 
systems, loss of biodiversity and changes to the composition of forest 
and woodland communities. Although it is evident that multiple mecha- 
nisms (hydraulic failure, carbohydrate depletion and insect attack) are 
involved in drought-induced mortality, these mechanisms are highly 
interdependent'’’*. Embolism formation is a key mechanism of vege- 
tation shifts and forest decline because it sets the thresholds for 
stomatal closure, leading to limitations on photosynthesis, increased 
heat and light damage, and a run-down of carbohydrate reserves over 
time. Our findings demonstrate the necessity of integrating long-term 
monitoring of Ynin With measurements of embolism resistance and 
safety margins. The inclusion of these data in process-based vegetation 
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models will improve the accuracy with which the responses of forest 
ecosystems to climate change can be predicted. 


METHODS SUMMARY 


Xylem traits were compiled from published and unpublished sources (Supplemen- 
tary Table 1). When values of 59 and ¥gg were not reported in numerical form, 
they were extracted from published graphs of vulnerability curves. Yyin data are 
the minimum midday water potential recorded for each species in the field, indi- 
cating a seasonal, rather than daily minimum value. Two types of safety margins 
were calculated: the %5o margin (Wmin — Yso) and the Ygg margin (Yimin — Ys) 
(Supplementary Fig. 1). 

Data in Fig. 1 were binned in 1.0-MPa increments of 59. Bins were pooled with 
the next lowest bin if they contained only one sample. Regression lines in Fig. 1 
presented are for raw data to avoid bias associated with uneven bin size. 
Differences in biome means for safety margin were analysed using a general linear 
model, with differences between angiosperms and gymnosperms considered sepa- 
rately. Climate data were taken from the paper in which Ys, data were published, 
or from the WorldClim database or CRU climate database, whichever gave an 
elevation closest to that given in the paper. Relationships between Y5o and climate 
variables (MAP, aridity index, mean precipitation of the driest quarter) were 
analysed using a generalized model assessed by restricted maximum likelihood 
in which variance was simultaneously modelled as a power function of the same 
climate variables. Quantile regression was also used to assess the boundary rela- 
tionship between 5, and climate variables. As so is a negative variable, the 10% 
quantile was used, which is equivalent to the 90% quantile for a positive response 
variable. 

Forest biomes were assigned based on site descriptions contained in the primary 
sources. We defined ‘forest’ broadly to include Mediterranean, savanna and wood- 
land environments that are not commonly classified as forests. The data set there- 
fore encompasses tree, shrub and liana species from vegetation communities with 
a significant component of woody plants. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The resistance of a species to embolism is described by a vulnerability curve, which 
shows the percentage loss of hydraulic conductivity (PLC, %) as a function of 
decreasing xylem pressure (%,, MPa); that is, as plant drought stress increases 
(Supplementary Fig. 1). We define three parameters on this curve: first, the xylem 
pressure at which 50% of conductivity is lost (Y%s9, MPa); second, the xylem pres- 
sure at which 88% of conductivity is lost (gg, MPa), which represents the upper 
inflection of the curve; and third, the slope of the curve between ¥%s9 and Vgg. 

The most negative Y, observed for a species is defined as Y nin (MPa). ‘Lethal’ 
Y nin Values reported in manipulative trials were not included in the database. If 
Y nin is equal to Y5o, this means that the plant has lost 50% of its hydraulic capacity. 
Ws is the most commonly used index of xylem embolism resistance in literature 
and represents the steepest part of the vulnerability curve, meaning that even a 
slight drop in Y, will result in a substantial reduction in hydraulic function. 
However, 59 does not represent the mortality point per se because the ¥, value 
corresponding to lethal levels of embolism may vary among species, depending on 
the water-use strategy of the whole plant. In our data set, 42% of the angiosperm 
species (versus 6% of the gymnosperm species) show Yinin Values below Yso. 

A hydraulic safety margin was defined as the difference between Ynin and Ys5o 
(Supplementary Fig. 1 and Fig. 2a). An alternative, more conservative safety margin 
was defined as the difference between Yimin and Wg (Fig. 2b). Both the 59 and the 
Ws safety margins should be interpreted as relative vulnerability indices with a low 
(or even negative) value indicating a higher-risk strategy for hydraulic failure com- 
pared to a high safety margin. 

The data set was compiled from published papers and unpublished results of the 
authors (see Supplementary Table 1), including 480 species from 172 sites, with 384 
angiosperm and 96 gymnosperm species. Genera and species names were checked 
for orthography and synonymy using TaxonScrubber (version 2.1; www.salvias. 
net/pages/taxonscrubber.html), the International Plant Names Index (www. 
ipniorg), Tropicos (www.tropicos.org), Phylomatic (www.phylodiversity.net/ 
phylomatic/) and the Angiosperm Phylogeny Website (http://www.mobot.org/ 
MOBOT/research/APweb/). Samples based on measurements of roots, petioles 
or trunks were removed for the analyses. Data presented for %s9 and Ygg were 
from distal branches (approximately 0.5-1.2 cm in diameter) of 1- to 3-year-old 
stems and therefore represent a conservative estimate of embolism resistance, as 
roots and leaves are normally less embolism resistant than stems within a species. 
All but 2% of the samples are from mature trees with 2% of samples coming from 
saplings or seedlings. PET data were extracted from the Global Aridity Index 


(Global-Aridity) and the Global Potential Evapo-Transpiration (Global-PET) 
Geospatial Database (http://www.cgiar-csi.org/2010/04/134/). 

The data set was then filtered for the target parameters. The data set was first 
filtered for samples in which both Y%59 and Yimin were available, and filtered 
separately for Y%59 and MAP, which accounted for the majority of samples initially 
included. Only gg data for Yimin filtered species were included. Relationships 
between Ys, and climate variables (MAP, aridity index, mean precipitation of the 
driest quarter) were analysed using a generalized model assessed by restricted 
maximum likelihood in which variance was simultaneously modelled as a power 
function of the same climate variables. Quantile regression was also used to assess 
the boundary relationship between Y59 and climate variables. As 50 is a negative 
variable, the 10% quantile was used, which is equivalent to the 90% quantile for a 
positive response variable. 

Wsq and Yinin were collected from the same population of plants with the 
exception of a few cases in which nin data were extracted from another study 
or from unpublished sources. In 67% of our samples Yin Was measured as xylem 
or stem water potential, that is, the leaves were covered with plastic and aluminium 
foil such that leaf and stem water potentials were equilibrated. In the remaining 33% 
of cases Y,,i, was measured as leaf water potential. In this case, Y% may be less 
negative than leaf water potential because of the pressure drop across the leaf 
hydraulic pathway caused by transpiration. Because of the difference in leaf 
and xylem water potential it is possible the amount of embolism in the stem could 
be overestimated, that is, the safety margin would appear narrower than it actually 
was. 

However, there are two main reasons why this issue would not alter the results. 
First, Ynin has most probably been underestimated in most cases because we lack 
continuous long-term data sets of water potential. Second, 48% of the species for 
which leaf water potential was recorded were below —2.0 MPa. At this point the 
stomata are assumed to be closed in isohydric species (that is, species that close their 
stomata to prevent ¥, from dropping) and thus leaf water potential would be close 
to equilibrium with stem water potential. To estimate the magnitude of any poten- 
tial error, we correlated Ynin pre-dawn and Yimin midday for studies in which both 
were available. Although there is equilibrium for Yimin pre-dawn between leaves 
and stems, in theory there should be a more significant drop in Ynin midday for 
leaves than for stems. However, the slope of the relationship between Yin midday 
and Yin pre-dawn was statistically indistinguishable for leaves and stems, indi- 
cating that no bias was introduced by combining leaf and stem Y,in data (analysis 
of covariance (ANCOVA) interaction term, F;,;12 = 0.944, P = 0.333). 
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The genomic landscape of species divergence in 


Ficedula flycatchers 
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Unravelling the genomic landscape of divergence between lineages 
is key to understanding speciation’. The naturally hybridizing col- 
lared flycatcher and pied flycatcher are important avian speciation 
models*”’ that show pre- as well as postzygotic isolation®’. We 
sequenced and assembled the 1.1-Gb flycatcher genome, physically 
mapped the assembly to chromosomes using a low-density linkage 
map”? and re-sequenced population samples of each species. Here 
we show that the genomic landscape of species differentiation is 
highly heterogeneous with approximately 50 “divergence islands’ 
showing up to 50-fold higher sequence divergence than the geno- 
mic background. These non-randomly distributed islands, with 
between one and three regions of elevated divergence per chro- 
mosome irrespective of chromosome size, are characterized by 
reduced levels of nucleotide diversity, skewed allele-frequency 
spectra, elevated levels of linkage disequilibrium and reduced pro- 
portions of shared polymorphisms in both species, indicative of 
parallel episodes of selection. Proximity of divergence peaks 
to genomic regions resistant to sequence assembly, potentially 
including centromeres and telomeres, indicate that complex repeat 
structures may drive species divergence. A much higher back- 
ground level of species divergence of the Z chromosome, and 
a lower proportion of shared polymorphisms, indicate that sex 
chromosomes and autosomes are at different stages of speciation. 
This study provides a roadmap to the emerging field of speciation 
genomics. 

As lineages diverge, a combination of pre- as well as postzygotic 
reproductive isolation barriers will eventually arise’. Divergence is 
likely to start from specific loci that may precede and cause the evolu- 
tion of reproductive incompatibility. Hybridization between diverging 
lineages may therefore create a genomic mosaic of regions where 
interspecific gene flow occurs at different rates (the genic view of 
speciation"), with introgression expected to be weak in genomic 
regions involved in speciation. Revealing the genomic regions with 
elevated levels of divergence will eventually deepen our knowledge 
of the speciation process. However, more than 150 years after the 
publication of On the Origin of Species’’, the genetic basis of speciation 
is still largely unresolved’*"*. We know little about the identity, num- 
ber and effect size of loci involved in population divergence, their 
genomic distribution and the type of mutations involved. Advances 
in sequencing technology now open a promising avenue for the study 
of genomic divergence, even for non-model vertebrate species with 
gigabase (Gb)-sized genomes. 

The collared flycatcher Ficedula albicollis and the pied flycatcher 
Ficedula hypoleuca (Fig. 1) are important study organisms for key 
aspects of evolutionary ecology and biology*’. Diverged less than 
2 million years ago, their history has been shaped by repeated cycles 
of glaciation in Eurasia where periods of allopatric divergence in 
refugia probably alternated with periods of secondary contact during 
which gene flow and selection were vital components; they still 


hybridize in areas of sympatry (Supplementary Figure 2). To study 
the genetic basis of species divergence in this system, we sequenced and 
assembled the flycatcher genome, and physically placed, ordered and 
oriented sequence scaffolds along chromosomes through linkage-map 
data. This was followed by re-sequencing of genomes and transcrip- 
tomes of population samples of both species (Supplementary Methods, 
Supplementary Fig. 1), allowing base-pair (bp)-resolution of the pattern 
of differentiation on a genomic level and providing a roadmap for 
studies in the emerging field of speciation genomics. 

The final assembly encompassed 1.13 Gb with an N50 scaffold size 
of 7.3 Mb and with 89% of the assembly contained within no more 
than 200 scaffolds larger than 1Mb (Supplementary Tables 1-7). 
The sequenced bird was heterozygous at 3.66 million positions, cor- 
responding to an average of one segregating site every 330 bp. A low- 
density linkage map of collared flycatcher'® anchors 73% of the assembly 
to chromosomes and orients scaffolds along them (Supplementary 
Fig. 3). Based on conserved chromosomal organization between 
flycatcher and zebra finch (Supplementary Fig. 4) we were able to 
anchor and orient additional scaffolds, thereby physically positioning 
1.00 Gb of the assembly (89%) in the genome (Supplementary Fig. 5). 
This illustrates that physical assembly of Gb-sized genomes sequenced 
with short reads is possible with modest linkage information and when 
assisted with genome information from a related species. The fly- 
catcher genome contained 18,735 predicted protein-coding genes, of 
which 18,649 (>99.5%) were identified as expressed based on RNA- 
seq data from a variety of tissues. 

We then sequenced the genomes of 10 unrelated males of each 
species (mean coverage 5.69x + 2.01s.d.; Supplementary Table 8) 
and found 9.86- and 7.13-million segregating sites in collared flycatch- 
ers and pied flycatchers, respectively. The fact that 3.81 million of these 
single nucleotide polymorphisms (SNPs) (53.4% and 38.6% of the total 
in each species, respectively) were shared between species confirms 
their close genetic relationship and provides an unusual access to 
genomic data of two species before complete lineage sorting. The mean 
pairwise nucleotide difference in interspecific comparisons (d,,) for 


Figure 1 | Study species. a, Male collared flycatcher. b, Male pied flycatcher. 
Note that the male collared flycatcher has a white neck collar and a more 
pronounced white forehead and wing patches. Photographs courtesy of Johan 
Traff. 
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50-kb windows was 0.0046 + 0.0011, which was only marginally 
higher than the mean pairwise nucleotide diversity (z) in intraspecific 
comparisons (7¢o1: 0.0036 = 0.0010; tpieg: 0.0021 + 0.00076); indivi- 
duals of the two species are thus genetically not much more different 
from each other than are individuals within species. Acknowledging 
that population samples of 10 individuals from each species provide 
low power for detecting rare alleles, an indication of species divergence 
can be obtained by noting at how many sites all collared flycatchers 
were homozygous for one allele and all pied flycatchers homozygous 
for another (which we refer to as sites of fixed differences, d;). We 
found 239,745 such sites, which corresponds to 1 fixed difference every 
4.7 kb. Of these, 1,513 sites were located within protein-coding regions 
and it is realistic that a proportion of these positions represent the 
genetic basis for key phenotypic differences between the two species. 

The genomic landscape of species divergence was highly hetero- 
geneous, with a fraction of windows showing highly elevated diver- 
gence up to 50 times higher than the genomic median (0.00013) and 
mean (0.00022) (Supplementary Fig. 6). The distribution of autoso- 
mal windows with elevated divergence showed a non-random pattern 
with approximately 50 well-defined clusters of high Fey (the fixation 
index, a measure of population differentiation) and d; (‘divergence 
peaks’ or ‘genomic islands of divergence’) (Fig. 2a and Table 1). 
Average peak size was in the range of several hundred kb (median, 
400 kb; mean, 625 kb; range, ~ 100 kb to 3 Mb; Supplementary Fig. 7) 
and, in total, divergence peaks covered 2.7% of the genome yet con- 
taining 25% of all fixed differences. An immediate feature of the 
distribution of these 50 peaks was that they were non-randomly dis- 
tributed across the genome (Kolmogorov-Smirnov test, D = 0.4516, 
P= 0.0002), irrespective of chromosome size and despite substantial 
heterogeneity in chromosome size, there were in most cases one to 
three peaks per chromosome (Fig. 2a). Moreover, peaks were highly 
overrepresented in the very end of chromosomes and six microchro- 
mosomes had peaks in both ends. Another feature of divergence islands 
was that they lie at the end of scaffolds, thereby not forming a con- 
tinuous and symmetric signal in the assembly. As a consequence, peaks 
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Table 1 | Mean values of population genomic parameters 


Parameter Genomic background Islands of divergence Extreme per peak 
dt 0.00027 0.00171 0.00281 
Fst 0.357 0.742 0.856 
Tepied 0.00219 0.00067 0.00065 
Toll 0.00370 0.00132 0.00030 
pied 0.376 0.129 —0.428 
coll 0.221 0.053 —0.466 
ried 0.059 0.082 0.133 
r coll 0.065 0.088 0.111 


Data are from autosomal 50-kb windows divided into the genomic background and genomic islands of 
divergence (windows with the density of fixed differences, d; > 0.001). D, Tajima’s D, 7, an estimate of 
linkage disequilibrium. ‘Extreme per peak’ represents the mean of the highest or lowest value per peak. 
Differences between all parameter estimates in divergence islands versus genomic background are 
statistically significant at P< 2.2 x 10° 1° (Wilcoxon test). 


in the end of chromosomes were generally ‘one-tailed’, whereas peaks 
within the interior of chromosomes were usually formed by adjacent 
scaffolds with maximum divergence juxtaposed to the assembly gap 
between scaffolds (Fig. 2b and Supplementary Fig. 8). This peculiar 
pattern raises the issue of whether peaks are artefacts associated with 
the scaffolding process or read mapping. However, several observa- 
tions convincingly argue against this (Supplementary Notes). 

If selection has driven population divergence in regions of high 
differentiation, we might expect to see reduced levels of within-species 
diversity in these regions in one of the species. Species-specific esti- 
mates of 2 showed that this was essentially always the case (Fig. 2b, 
Supplementary Fig. 8), with mean z in divergence islands less than 
one-third of the genomic background level (Table 1). As Fsr by its 
nature is a function of within-species diversity’, this association might 
be trivial. However, because the vast majority of divergence islands 
were seen with Fs; as well as d; regions of high divergence were 
characterized both by a high frequency of sequence differences 
between the two species and a low frequency of sequence differences 
within species. A noteworthy consequence of these coinciding features 
was that d,, did not exceed background levels in divergence islands 
(Fig. 2b and Supplementary Fig. 8). Further indication of selection in 
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Figure 2 | The genomic landscape of species divergence in flycatchers. 

a, Distribution of divergence measured as the density of fixed differences per bp 
for 200-kb windows across the genome. Chromosomes are listed in numerical 
order and are separated by gaps. Red horizontal bars show the approximate 
location of centromeres in homologous chromosomes of zebra finch. Open 
read symbols are used to indicate that avian microchromosomes are generally 
acro- or telocentric; both ends of these chromosomes are labelled as the 
orientation is not known. For chromosomes 4, 6 and 8, there is a lack of an in 
situ mapped marker 5’ of the centromere in zebra finch. b, Distribution of 
population genomic parameters along an example chromosome (chromosome 
4A). The plots show the density of fixed differences per bp (d;) (yellow), Fst 
(red), the total between-species sequence divergence (dy), nucleotide diversity 


(x) for each species, the proportion of shared polymorphisms among sites 
polymorphic in at least one species (purple), the proportion of private 
polymorphisms among sites polymorphic within species (private and shared 
polymorphisms shown in the same panel), Tajima’s D, and linkage 
disequilibrium (77). For 2, private polymorphisms, D and 1’, species-specific 
estimates are given for collared flycatcher in blue and for pied flycatcher in 
green. Assigned scaffolds are shown under the plot: black, denoting scaffolds 
ordered and oriented by the collared flycatcher linkage map; grey, scaffolds 
ordered with the collared flycatcher linkage map and oriented through 
comparative mapping with zebra finch; white, scaffolds ordered and oriented 
through comparative mapping with zebra finch. 
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regions of high divergence was given by the observations of allele- 
frequency spectra being skewed towards rare alleles and strong signals 
of linkage disequilibrium (Table 1, Fig. 2b and Supplementary Notes). 

How the abovementioned signs of selection are distributed between 
the two species is relevant for interpretation of the heterogeneous 
genomic landscape of species divergence. For almost all regions of 
elevated divergence, both species showed reduced nucleotide diversity 
(Fig. 2b and Supplementary Fig. 8). This cannot be explained by a loss 
of diversity in the ancestral population because it would not lead to the 
observed high incidence of fixed differences. Moreover, taking dg (the 
synonymous substitution rate) as a proxy for mutation rate, we found 
that regions of high divergence were not low in variability because of 
low mutation rate (generalized linear model, including chromosome 
length, peak versus non-peak; z = 0.598, P = 0.550). The association of 
high divergence with low diversity speaks further against mapping 
artefacts, which can increase divergence between species but should 
also lead to elevated diversity within species. Taken together, these 
results suggest that selection has acted to reduce genetic variability 
in the very same regions in the two lineages independently. 

Genomic regions with reduced levels of interspecific recombination 
are hindered from gene flow, facilitating the build-up of reproductive 
incompatibilities. Moreover, divergent selection may enhance differ- 
entiation over larger genomic regions when the intraspecific recom- 
bination rate is low (divergence hitchhiking’*’’). To assess the 
relationship between recombination and divergence, we estimated 
the population recombination rate (~, which equals N.r, where N, is 
the effective population size and r is the per-generation recombination 
rate), p/m (as reduced diversity, N,, within divergence islands will lower 
the estimates of p and contribute to differences in p between diver- 
gence peaks and the genomic background even if r would be similar) 
and used genetic distances from the collared flycatcher linkage 
map related to the physical distance between markers according 
to the genome assembly to assess the relationship between recombina- 
tion and divergence (Supplementary Methods and Supplementary 
Table 9). These tests provided no strong evidence of reduced recom- 
bination rate in the proximity of divergence islands (Supplementary 
Notes). 

The flycatcher karyotype has not been established, thus the location 
of centromeres is not known. However, avian microchromosomes are 
generally acro- or telocentric’’. An attempt to approximate the loca- 
tion of centromeres on flycatcher macrochromosomes was made using 
information on the location of centromeres in the karyotype of zebra 
finch, coupled with the high degree of flycatcher—zebra-finch synteny 
conservation" (Supplementary Fig. 4). This reveals considerable over- 
lap between the presumed location of flycatcher centromeres and 
divergence islands (Fig. 2a). This is particularly apparent when con- 
sidering the enrichment of divergence islands at the ends of micro- 
chromosomes. Moreover, the fact that several microchromosomes 
showed divergence peaks in both ends further suggests that at least 
some telomeric regions are highly differentiated between species. 

Limited pedigree data from multiple generations of flycatcher 
hybrid descendants demonstrates fitness reduction and suggests that 
the current rate of introgression is low’. Nevertheless, genetic data 
from a few loci have previously indicated detectable levels of ongoing 
gene flow*. To further address this issue we carried out deep sequen- 
cing of 24intronic regions spread across the genome in sympatric 
population samples of the 2 species (Supplementary Methods and 
Supplementary Table 10). Assuming an isolation-migration model, 
the maximum likelihood estimate of the rate of gene flow from pied 
flycatcher to collared flycatcher was 1.7 X 10° per gene and genera- 
tion (90% posterior density distribution 0.1-2.8 x 10 °), while the rate 
for the opposite direction was much lower (4.5 X 10-°). From the 
analysis of nested models we can reject a model without gene flow 
from pied flycatcher to collared flycatcher (likelihood ratio test, 
P<0.01) and estimate the rate at N,m = 0.38 (where m is the migra- 
tion rate), or roughly one migrant every three generations. 
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If divergence islands are involved in reproductive isolation, they 
might be expected to be shielded from gene flow in areas of sympatry, 
while other genome regions may get introgressed. One way of addres- 
sing this possibility is to study the distribution of private and shared 
polymorphisms to infer differential rates of lineage sorting, such as 
those caused by variation in gene flow across the genome. We found a 
very clear pattern at the point at which the proportion of shared poly- 
morphisms drops significantly in all divergence islands, from a mean 
background level of 32.8% to 18.3% in islands (Wilcoxon test, 
W = 632,244, P<0.001; Fig. 2b and Supplementary Fig. 8). This 
observation indicates more advanced lineage sorting within than out- 
side regions of elevated divergence, and is consistent with a role 
for gene flow in homogenizing background levels of divergence. 
Moreover, we found that the proportion of private polymorphisms 
was significantly higher in divergence islands than elsewhere in the 
genome (pied flycatcher: 35.3% in non-islands versus 56.1% in islands, 
W = 319,712, P< 0.001; collared flycatcher: 60.2% in non- islands vs. 
75.4% in islands, W = 608,994, P< 0.001; Fig. 2b and Supplementary 
Fig. 8), consistent with restricted gene flow in islands. Furthermore, the 
genomic background level of the proportion of private polymorphisms 
was considerably higher in collared flycatcher than in pied flycatcher 
(W = 19,001,201, P< 0.001, Fig. 2b and Supplementary Fig. 8), in 
agreement with the direction of gene flow recorded. 

Birds have female heterogamety (males, ZZ; females, ZW) and as all 
sequencing was carried out using male birds, read coverage is expected 
to be similar for autosomes and the Z chromosome. Estimates of 
diversity and divergence should therefore be directly comparable 
between chromosome categories. The Z chromosome showed greater 
than sevenfold higher mean divergence (d¢ , 0.0016 + 0.00060) than 
autosomes (0.00022 + 0.00036, W = 23,706,977, P<2 X10 '°) and 
significantly higher Fsr (0.623+0.076 versus 0.350 + 0.110, 
W = 23,274,298, P<2X 10 1°) (Fig. 3). Divergence was more uni- 
formly distributed along the Z chromosome and did not show the 
distinct islands of divergence characterizing most autosomes; the Z 
chromosome contained approximately 35% of all fixed sites in the 
genome. Moreover, estimates of z, Tajima’s D and 1° (an estimate of 
linkage disequilibrium) were also more uniform along the Z chro- 
mosome (Fig. 3 and Supplementary Fig. 8). Despite the higher mean 
divergence, we note that divergence at individual windows on the Z 
chromosome did not exceed that within autosomal divergence islands. 
High sex-linked divergence is thus a consequence of increased back- 
ground level rather than more extreme divergence in individual 
regions. Reports of a disproportionately large effect of the X chro- 
mosome on hybrid sterility and of reduced introgression of sex-linked 
genes” have fed the idea that sex chromosomes are particularly 
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Figure 3 | Contrasting levels of divergence and diversity between the Z 
chromosome and a similarly sized autosome (chromosome 1A). The plots 
show the density of fixed differences per kb (d,), Fsr, nucleotide diversity (z; 
collared flycatcher in blue and pied flycatcher in green) and the proportion of 
shared polymorphisms. 
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important for the build-up of reproductive isolation. This is supported 
by data from female heterogametic organisms”, including higher Fsr 
seen for a handful of Z-linked markers than for autosomal markers in 
flycatchers*". In flycatchers, mating patterns also suggest sex-linkage of 
male plumage traits and species recognition’, traits that may evolve 
under the influence of divergent selection. Our observations could be 
taken to suggest that natural or sexual selection at multiple loci assoc- 
iated with reproductive isolation on the Z chromosome has erased the 
signal from individual divergence islands by broadly increasing diver- 
gence to a higher background level. From this perspective, the Z chro- 
mosome could be seen to represent a more advanced stage of species 
differentiation, with islands turning into plateaus or divergence hitch- 
hiking turning into genome hitchhiking”. We note in this context that 
total sequence differentiation in interspecific comparisons was higher 
for the Z chromosome (mean d,y = 0.0057 + 0.0011) than for auto- 
somes (0.0045 + 0.0010; W = 19,256,489, P< 2X10 '°). Moreover, 
more advanced divergence of the Z chromosome compared to auto- 
somes was also supported by a significantly lower proportion of shared 
polymorphisms in the former (15.2%) than among the latter (32.3%; 
W = 1,408,692, P< < 0.001) (Fig. 3). 

The 50 regions defined as divergence peaks contained a total of 530 
protein-coding genes. An assessment of gene ontology among these 
genes did not reveal any functional category to be significantly over- 
represented (Supplementary Table 11). Moreover, we found no indica- 
tion that proteins encoded by genes within divergence peaks would be 
faster evolving than other proteins in the genome (general linearized 
model of dy/ds, where dy is the rate of non-synonymous substitution, 
including chromosome length; peak versus non-peak; z= 0.837, 
P= 0.403). As many peak regions contained more than one gene it 
is possible that unrelated features of linked genes blur characteristics 
common to genes under selection. However, we found that genes 
differentially expressed between species were significantly more com- 
mon in peak regions (246 out of 346 genes; 71.1%) than in the rest of 
the genome (4,180 out of 7,134 genes, 58.6%; ~? =11.2, P<10°°; 
Supplementary Table 12). One possible explanation to this observation 
is that standing variation at cis-acting regulatory elements in an ances- 
tral population has segregated via linkage to loci under divergent 
selection in peak regions. 

The collared flycatcher and the pied flycatcher probably started to 
diverge in allopatry in glacial refugia of the Mediterranean area during 
the Pleistocene epoch, candidate regions being the Iberian and 
Apennine peninsulas, respectively. Subsequent secondary contact dur- 
ing repeated cycles of interglacial periods allowed gene flow, just as 
hybridization and gene flow occurs in contemporary areas of sympa- 
try. According to this scenario, allopatric divergence may have been 
followed repeatedly by genomic homogenization in sympatry. The 
highly heterogeneous nature of genomic divergence between the two 
species is compatible with such a model, with some genomic regions 
refractory to gene flow. Our data show that these regions are localized, 
numerous, diverged far beyond the background level and present on 
almost all chromosomes, essentially shedding light on several of the 
central questions on the genomic landscape of species divergence. The 
consistent observation in both flycatcher species of reduced diversity 
in divergence islands would suggest that the same loci, or closely linked 
loci, have been subject to directional selection in both lineages inde- 
pendently. This, together with the juxtaposition of the most extreme 
divergence and gaps in the genome assembly, raise the possibility that 
centromeres or other heterochromatic repeats themselves actually are 
drivers of species divergence. The meiotic drive model of speciation 
invokes an arms race between centromeric alleles for deposition into 
the single resultant oocyte of female meiosis”, in which selection acts 
on allelic variation in the ability to attract microtubuli of an asymmet- 
ric spindle pole. This could lead to rapid evolution of repeat sequences 
as well as of proteins involved with spindle-fibre attachment to cen- 
tromeres, possibilities that are supported by empirical data’*, and 
may hinder proper chromosome segregation or pairing during hybrid 
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meiosis**. Interestingly, the Drosophila Zhr locus causing female 
lethality is itself a heterochromatic satellite- DNA block’, and the 
Drosophila ods-site homeobox (OdsH) speciation gene has been linked 
to hybrid male sterility through its binding to evolutionary labile het- 
erochromatic repeats’. Similar actions are suggested for other spe- 
ciation genes’*. There is evidence for segregation distortion in 
chicken chromosomes involving loci with centromeric or telomeric 
locations*. The genomic distribution of divergence peaks in flycatch- 
ers is compatible with an involvement of telomeres as well, as some 
chromosomes showed divergence signals in both ends. Telomeres 
have been shown to have an evolutionary conserved role during mei- 
osis in which they cluster on the nuclear envelope, forming a ‘telomere 
bouquet’, and enable chromosome movements to promote homolog- 
ous synapsis”. It is noteworthy in this respect that birds have more 
extensive arrays of telomeric repeats than other vertebrates, and show 
structural polymorphism of telomeres within species*’, setting the 
stage for a meiotic drive also in these types of repeats. As meiotic drives 
are characterized by repeated episodes of selection, this would be 
compatible with the relative large size of divergence islands observed. 

To conclude, this study presents the genome sequence of an avian 
speciation model and unravels the genomic landscape of species diver- 
gence in unprecedented detail. The results show strong heterogeneity 
in sequence differentiation in a species pair in which lineage sorting is 
incomplete. The potential connection of species divergence to key 
repetitive elements of chromosomes calls for a shift in focus, with 
the quest for genetic basis of reproductive isolation extended to include 
sequences other than protein-coding genes. For further dissection of 
the mechanism driving species divergence in this and other systems it 
will be important to obtain detailed maps of how the rate of recom- 
bination varies along chromosomes, based on large-scale genotyping 
in pedigrees. Together with modelling (under varying intensity and 
character of selection), this can address to what extent sweeps or 
background selection, possibly aided by low recombination, are 
expected to increase divergence and over what distances. As the size 
of genomic islands of divergence will also be affected by variation in N, 
and the rate of migration, these are also factors that need to be inte- 
grated in models. Moreover, extensive genotyping in pedigrees would 
be a means to test for segregation distortion introduced by meiotic 
drives. 


METHODS SUMMARY 


Genome sequencing was carried out with Illumina technology using DNA from 
a single wild-caught male collared flycatcher. Sequences from paired-end and 
mate-pair reads of multiple libraries (200-21,000 bp) were assembled using 
SOAPDENOVO in subsequent steps with increasing insert size of libraries. 
Scaffolds were physically anchored to chromosomes by the aid of a collared- 
flycatcher linkage map and with comparative map information from the zebra- 
finch genome. Protein-coding genes of the flycatcher genome were retrieved 
through a combination of mapping reads to zebra-finch gene templates, using 
flycatcher expressed sequence tag (EST) evidence and ab initio prediction. Levels 
of gene expression were measured across a suite of tissues (embryonic, adult 
somatic and gonadal tissues) using RNA-seq with Illumina technology, and 
differentially expressed genes were identified with BAYSEQ. Population genomic 
analyses were based on data from re-sequencing of 10 individuals each of collared 
flycatcher and pied flycatcher, in which reads were mapped to the assembly using 
BWA (Burrows-Wheeler Aligner) software. After analysis and data processing 
using a combination of software tools, sequence variants were identified with 
the Genome Analysis Toolkit (GATK; Broad Institute). Divergence and diversity 
parameters were estimated using ‘haploidized’ data by randomly choosing one 
allele from heterozygous genotypes. 
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A map of visual space in the primate entorhinal cortex 


Nathaniel J. Killian!?, Michael J. Jutras! & Elizabeth A. Buffalo’? 


Place-modulated activity among neurons in the hippocampal 
formation presents a means to organize contextual information 
in the service of memory formation and recall’?. One particular 
spatial representation, that of grid cells, has been observed in the 
entorhinal cortex (EC) of rats and bats*°, but has yet to be 
described in single units in primates. Here we examined spatial 
representations in the EC of head-fixed monkeys performing a 
free-viewing visual memory task®’. Individual neurons were iden- 
tified in the primate EC that emitted action potentials when the 
monkey fixated multiple discrete locations in the visual field in each 
of many sequentially presented complex images. These firing fields 
possessed spatial periodicity similar to a triangular tiling with a 
corresponding well-defined hexagonal structure in the spatial auto- 
correlation. Further, these neurons showed theta-band oscillatory 
activity and changing spatial scale as a function of distance from 
the rhinal sulcus, which is consistent with previous findings in 
rodents**"°. These spatial representations may provide a frame- 
work to anchor the encoding of stimulus content in a complex 
visual scene. Together, our results provide a direct demonstration 
of grid cells in the primate and suggest that EC neurons encode 
space during visual exploration, even without locomotion. 

The primate hippocampal formation has been shown to represent 
both egocentric spatial information, with cells responding to self- 
motion and head direction, as well as allocentric spatial information, 
with cells firing more spikes when a particular region of an environment 
is viewed“. The presence of allocentric spatial view cells in the hip- 
pocampus, analogous to rodent place cells, suggests that the primate 
entorhinal cortex (EC) may also represent space independent of the 
position of the animal. Indeed, the non-retinocentric, bilateral receptive 
fields of primate EC neurons allow for the possibility of regular and 
allocentric spatial firing within the EC'*. Furthermore, a recent study 
has provided evidence for grid cells based on human neuronal popu- 
lation responses in the EC that are periodic with respect to position in 
virtual space’®. To examine spatial representations in the primate hippo- 
campal formation, we recorded the activity of 342 neurons from the EC 
and hippocampus of three monkeys performing a free-viewing visual 
recognition memory task, the visual preferential looking task (VPLT)°*”. 
Novel images were each presented twice on a computer monitor with a 
fixed reference frame, and gaze location and neuronal data were recorded 
simultaneously. Images consisted of photographs that contained a wide 
variety of elements, including abstract art, animals, landscapes and peo- 
ple, and the monkeys explored the static images with a dynamic sequence 
of fixations (Fig. 1a). On average, EC neurons gave enhanced responses 
to stimulus presentation and many demonstrated a reduced firing rate 
for repeat presentations, consistent with previous findings of match 
suppression (Fig. 2)'°. Neurons in the hippocampus exhibited a more 
diverse range of suppressed and excited visual and recognition res- 
ponses®. All of these responses were contained within the analysed 
data; we included data segments in the analyses only if the monkey 
was actively exploring an image on the screen (pre-stimulus fixation 
periods and viewing outside of the image bounds were not included). 

In the rat EC, grid cells are found exclusively in the medial EC 
(MEC). Spatial and visual input to the EC is topographically organized 


such that spatial information reaches the MEC through input from the 
postrhinal cortex (the parahippocampal cortex in primates), while 
object information reaches lateral EC (LEC) via the perirhinal cortex”. 
In monkeys, roughly the posterior half of the EC receives visuospatial 
information from parahippocampal cortex, retrosplenial cortex and 
presubiculum, while the anterior half receives visual object information 
from the perirhinal cortex’*. Projections from EC to the hippocampus 
also havea similar anatomical organization in rats and monkeys, but the 
anatomical axes are oriented differently. The anterior EC in the monkey 
and LEC in the rat project to the region around the border between CA1 
and subiculum, whereas the posterior EC in the monkey and MEC in 
the rat project to proximal CA1 and distal subiculum”’. Based on the 
topography of these inputs, we would expect the rat MEC and LEC to 
correspond to the monkey posterior and anterior EC, respectively. 

The spatial firing fields of grid cells can be thought of as representing 
the nodes of a triangular grid, and it follows that the spatial auto- 
correlation of the firing-rate map should have a well-defined hexagon 
surrounding a central peak. A gridness score has been used to quantify 
the strength of 60-degree rotational symmetry that leads to this hexa- 
gonal structure (Supplementary Fig. 1)'°”°. The gridness of all recorded 
units was evaluated and surrogate data were used to determine signifi- 
cant gridness scores. Grid cells were classified as those having a grid- 
ness score above the 95th percentile of the scores for 100 time-shifted 
permutations of the spike timings (Fig. 1b, e). The number of grid cells 
identified in the posterior EC (23 out of 193; 11.9%) was significantly 
greater than that expected by chance (P < 0.0001, ¥7(19.4,1)) (Fig. 1b, 
c; see also Supplementary Figs 2a and 3). Importantly, the spatial 
density of gaze location did not produce high gridness scores (Sup- 
plementary Fig. 2b), and across the population, the grid scores for EC 
grid cells were consistently higher than grid scores calculated for eye 
movements (P< 1X 10 °, Wilcoxon rank-sum test; Supplementary 
Fig. 2b). Because these analyses were computed across presentations of 
multiple complex stimuli, these data suggest that the grid-cell repres- 
entation is not specific to stimulus content. To address this explicitly, 
we examined the reliability of these representations by comparing the 
rate maps generated from the first and second halves of each session. 
There was a significant positive correlation between these firing rate 
maps across the population of grid cells (P<1X10-°, n= 23, 
Wilcoxon signed-rank test), which suggests that these representations 
are stable and stimulus-independent (see Supplementary Fig. 4). 

The proportion of grid cells identified in the hippocampus (4 out of 
119; 3.4%) was not significantly different from chance (P= 0.41, 
x’ (0.67,1)) and was significantly less than the number of EC grid cells 
(P = 0.009, x7(6.82,1)) (Fig. 1c). An important caveat is that our hip- 
pocampal recordings were all taken from the anterior hippocampus’, 
which is presumably comparable to the temporal hippocampus in 
rats, containing cells with larger place fields than in septal regions. 
Accordingly, it is difficult to compare the scale of the spatial represen- 
tations between the EC and the hippocampus with the present data. 

Wealso found a significant population of neurons in the EC, but not 
the hippocampus, with increased firing rate near the edges of the 
stimuli, quantified using a border score as described in ref. 5 (Fig. 1c, 
e) (EC, P= 0.0058, y°(7.61,1); 18 out of 193 (9.3%); hippocampus, 


Yerkes National Primate Research Center, 954 Gatewood Road, Atlanta, Georgia 30329, USA. *Wallace H. Coulter Department of Biomedical Engineering at the Georgia Institute of Technology and Emory 
University, 313 Ferst Drive, Atlanta, Georgia 30332, USA. 3Department of Neurology, Emory University School of Medicine, 1440 Clifton Road, Atlanta, Georgia 30322, USA. 


29 NOVEMBER 2012 | VOL 491 | NATURE | 761 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


d 


Cells (%) 


[op] 


j 


(mm) 


ees a ns. 


MP_1213.07c 


6 dv.a. 12 dv.a. 


1mm,5.2 deg 2.5mm,6.5 deg 3.5mm, 8.1 deg 


ge 22 0.567, P= + Pe nag 


TT_209.06a MP_604.05a MP_614.08a 


Border score 


Figure 1 | Spatial representation in the primate entorhinal cortex. 

a, Recordings were carried out using a linear electrode array placed in the 
entorhinal cortex (red arrow). Three example 10-s scan paths are shown in 
yellow. b, An example of an entorhinal grid cell. Left, plots of eye position (grey) 
and spikes (red) reveal non-uniform spatial density of spiking. For clarity, only 
spikes corresponding to locations of firing rate above half of the mean rate were 
plotted. The monkey’s name and unit number are indicated at the top. Middle, 
spatial firing-rate maps show multiple distinct firing fields. Maps are colour 
coded from low (blue) to high (red) firing rates. The maximum firing rate of the 
map is indicated at the top. Right, the spatial periodicity of the firing fields 
shown against spatial autocorrelations. The colour scale limits are + 1 (blue to 
red), with green being 0 correlation. d.v.a., degrees of visual angle; g, gridness 


P=0.1995, ¥7(1.65,1), 9 out of 119 (7.6%); see Supplementary 
Information for a detailed description of methods). The presence of 
cells that represent spatial locations relative to the stimulus bounds 
independent of stimulus content would suggest that representations of 
objects within an environment may be anchored to a consistent frame- 
work, with the stimulus bounds serving as landmarks. However, it is 
possible that these neurons simply have oblong view fields near one or 
more edges of the image-presentation region. This would need to be 
examined in future studies with changing image frames. 

In the rat and bat EC, grid cells increase in field spacing in the 
dorsomedial to ventrolateral direction, as the distance from the border 
of the MEC with the postrhinal cortex increases**. This gradient 
mirrors the increase in the size of hippocampal place fields along the 
dorsoventral axis*’, to which the MEC provides input in a topograph- 
ical manner’’”’. In the monkey, cells located in lateral EC (close to the 
rhinal sulcus) project to posterior levels of the hippocampus, whereas 
cells located in the medial EC project to more anterior levels of the 
hippocampus’. In the present study, firing-field spacing was signifi- 
cantly correlated with the distance from the rhinal sulcus for each 
monkey, and for both monkeys together (Spearman’s rank correlation 
coefficient: p = 0.671, P = 0.024 for monkey MP; p = 0.665, P = 0.026 
for monkey TT; p = 0.567, P = 0.006 for both monkeys) (Fig. 1d; see 
also Supplementary Fig. 5). 
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score. ¢, Percentages of cells in the EC and hippocampus (HPC) with a 
significant gridness score (G) or border score (B). The black line shows the 5% 
chance level, the dashed lines represent the 95% confidence level. *P < 0.05, 
**P < 0.01. d, Left, grid-cell spacing increased with distance from the rhinal 
sulcus (r.s.), consistent with a dorsal—ventral gradient in rodents and bats. Open 
and closed circles identify the grid cells from each of the two monkeys. Right, 
autocorrelations for representative grid cells recorded at different locations 
medial to the rhinal sulcus. The monkeys’ names and unit numbers are 
indicated at the bottom. e, Gridness and border scores are plotted for all cells 
recorded in the posterior EC (n = 193; red, cells with significant gridness scores, 
n = 23; blue, cells with significant border scores, n = 18). 


In addition to spatial representations, we examined neuronal res- 
ponses in the EC that might underlie recognition memory. Consis- 
tent with the input from perirhinal cortex to anterior EC, we found 
that neurons in anterior EC showed stronger visual and memory res- 
ponses than did neurons in posterior EC. Specifically, neurons 
recorded at more anterior locations were more likely to be responsive 
to visual stimuli (likelihood ratio test, P = 0.0062, ¥7(7.5,1)); and at 
progressively more anterior locations, larger proportions of these 
visually responsive neurons showed a reduction in firing rate for 
repeated stimuli, that is, ‘repetition suppression’ (likelihood ratio test, 
P=6.67 X 10°, ¥7(33.63,1)) (Supplementary Fig. 6). Furthermore, 
among the neurons displaying significant repetition suppression, the 
relative decrease in firing rate for repeat presentations was greater at 
more anterior locations (Fig. 2). Interestingly, the recognition memory 
and spatial representations were somewhat independent of each other; 
at more anterior recording locations, grid cells were more likely to 
also exhibit a memory response (likelihood ratio test, P = 0.0034, 
7(8.55,1)), and the magnitude of firing-rate reduction was in line with 
the rest of the population of cells. However, there was a gradual decline 
in the percentage of grid cells at more anterior locations, and no grid 
cells were found in any of 3 separate penetrations in front of the 
posterior 50% of the EC, suggesting a functional border between pos- 
terior and anterior EC. Because most of our recordings were in the 
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Figure 2 | Recognition memory and conjunctive grid-memory cells. The 
strength of the memory response (percentage decrease in rate for repeated 
stimuli) increased in more anterior recording locations (n = 85 cells with a 
significant recognition memory response). Grid cells (large black circles) were 
more likely to show a recognition memory response at more anterior locations 
(the number of grid cells without a significant memory response is given 
below the cluster of points at each location). Neuronal responses to novel and 
repeat presentations are shown for some representative neurons (right and 
bottom panels; mean ~ s.e.m.). Grey shading indicates a time region of 
significant decrease in firing rate for repeat presentations (P < 0.025; 
Supplementary Methods). The monkeys’ names and unit numbers are 
indicated above the graphs. 


posterior EC, an important target of future studies will be to further 
characterize responses throughout the anterior EC. 

EC layers were classified using current-source density estimates (see 
Supplementary Methods). Grid cells were found at an approximately 
equal frequency across both superficial and deep layers (14 out of 125 
in superficial layers and 9 out of 68 in deep layers), suggesting that grid 
cells have a role in processing both input to and output from the 
hippocampus. Because many grid cells in layers deeper than layer II 
are modulated by head direction in rodents, it is possible that recording 
with the monkey’s head positioned in variable directions would 
increase these percentages. In addition, the physical restriction of using 
dorsal to ventral penetrations precludes higher sampling of the thin 
layer II that possesses the largest percentage of grid cells in rodents. 

As theta-band modulation is critical to certain models that describe 
the generation of grid cells”, we next examined theta-band oscillatory 
activity in individual EC grid cells and in the simultaneously recorded 
local-field potential (LFP) on an adjacent electrode. We found that the 
EC exhibits intermittent bouts of theta oscillations similar to the theta 
bouts described previously in the primate hippocampus (Fig. 3a)**”. 
These bouts, detected during blocks of image presentations, had a mean 
duration of 1.09 + 0.86 s and a mean inter-bout interval of 0.95 + 0.94 
(mean = s.d.). We also found that across the recording session, the grid 
cells were phase-locked to near the trough of the LFP theta (Fig. 3b) and 
the spike trains of 13 out of 23 grid cells (57%) were theta-modulated 
(Fig. 3c), consistent with findings in rodents**’’. Theta modulation and 
phase locking were not limited to grid cells but were observed in the 
population of EC and HPC neurons as a whole (Supplementary Fig. 7c, 
d). During image viewing, saccadic eye movements were made at a 
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Figure 3 | Theta bouts and theta modulation of grid cells. a, Top, 5-s raw 
LFP trace with theta (3-12-Hz) bouts indicated in grey. Bottom, spectrogram 
showing the power (tV”) of the top LFP trace. b, Grid cells were phase locked to 
LFP theta (60-degree bins, red curve, mean + s.e.m.; dashed curve, prototype 
theta oscillation, two cycles are shown). c, Autocorrelation showing theta 
modulation of the spike train of a grid cell (theta index = 11.3 at 4.2 Hz). 

d, Histogram of the saccade rate during image viewings in 19 sessions 

(4.16 + 1.49 Hz, mean + s.d.; dashed grey line indicates the median, 3.92). 

e, Saccade times (1 = 17 sessions) resulted in lower theta indices than the grid- 
cell spike trains (n = 20) (P< 0.01, Wilcoxon rank-sum test). 


median rate of 3.92 Hz (Fig. 3d), but the precise timing of saccades 
was not theta modulated across the session, suggesting that the theta 
modulation of grid cells was not due to rhythmicity in eye movements 
(Fig. 3e). Firing-rate maps for bout periods were correlated with non- 
bout periods (P< 0.01, Wilcoxon signed-rank test) and we did not 
find a significant difference in the grid pattern between bouts and 
non-bouts (Supplementary Fig. 8). Future experiments involving dis- 
ruption of theta would aid in understanding the relationship between 
primate theta and spatial firing patterns”'®. Importantly, the present 
data demonstrate theta-band modulation among EC grid cells, even in 
a species with non-continuous theta. This information is critical for 
informing computational models and increasing our understanding of 
the generation of these spatial representations~”*. 

Taken together, these data provide evidence for grid cells in the 
primate entorhinal cortex. These spatial responses are similar in many 
respects to grid cells that have been described previously in the rat 
and bat during locomotion, but were identified with a distinct method 
of sampling the environment, that is, visual exploration through 
eye movements. These results suggest that spatial representations in 
primates can arise during visual exploration at a distance, without 
requiring an actual visit to that place. Accordingly, these data suggest 
that theories of spatial representation in the context of navigation 
could be applied to visual exploration. Because these experiments were 
performed with the monkey’s head and the visual stimulus in a given, 
fixed position, we are not able to identify whether these grid cells 
represent allocentric or egocentric spatial locations. Notably, although 
grid-cell spiking was theta modulated, we found significant evidence 
for grid cells in the absence of continuous theta-band oscillations in the 
LFP. These data suggest that current models of grid cells based on 
interactions of oscillations will need to be adapted to account for both 
intermittent theta and exploration through saccadic movements. One 
possibility is that saccades produce a theta-band phase shift in the LFP. 
This could modulate the theta-band frequency in a way that might be 
comparable to the modulation in frequency that occurs through move- 
ment velocity in rodents. It is also likely that an optimal model will 
include aspects of current oscillatory interaction and network attractor 
models”. These results provide a potential challenge to the view that 
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grid cells support path integration by combining self motion and 
environment cues’; however, it will be important to determine whether 
corollary discharge signals that enable the monitoring of eye move- 
ment commands might contribute significantly to the generation of 
grid fields**. It is possible that the spatial representations identified 
here reflect an integration of environmental context and the mag- 
nitude of eye movements, but this will need to be carefully examined 
in future experiments. 


METHODS SUMMARY 


In each VPLT session, 200 novel images were each presented twice on a computer 
monitor in a random order, with the images appearing at the centre of the screen 
and subtending 11 X 11 degrees of visual angle from the monkey’s perspective. 
Monkeys were head fixed and seated in a chair with the centre of the monitor 
aligned to their neutral eye position. An image was removed after the monkey 
looked outside of the image bounds, or after 5 s, whichever was shorter. In a second 
version of the task that was used to record from 7 EC units, images covered the 
entire viewable region of the monitor (33 X 25 degrees of visual angle). In this 
version, each of 36 novel scenes was shown twice and a total of 10s of visual 
exploration was required for each presentation. Gaze location was recorded with 
an infrared eye-tracking system (ISCAN). We recorded spikes (250-8,000 Hz) and 
LFPs (0.7-170 Hz) from the EC with a laminar electrode array mounted on a 
tungsten microelectrode (12-site, 150-j1m spacing; FHC). Rate maps were com- 
puted with a Gaussian smoothing procedure”. Gridness scores and border scores 
were calculated using standard equations and significance was evaluated with a 
standard shuffling procedure*”’. Theta-band bouts in the LFP were detected by 
statistically analysing the power spectrum over time”. Theta-band modulation of 
grid cells was analysed with a standard autocorrelogram power spectrum metric, 
the theta index”’. All experiments were carried out in accordance with protocols 
approved by the Emory University Institutional Animal Care and Use Committee. 
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Self-renewal of embryonic-stem-cell-derived 
progenitors by organ-matched mesenchyme 


Julie B. Sneddon'*, Malgorzata Borowiak'* & Douglas A. Melton! 


One goal of regenerative medicine, to use stem cells to replace cells 
lost by injury or disease, depends on producing an excess of the 
relevant cell for study or transplantation. To this end, the stepwise 
differentiation of stem cells into specialized derivatives has been 
successful for some cell types’*, but a major problem remains the 
inefficient conversion of cells from one stage of differentiation to 
the next. If specialized cells are to be produced in large numbers it 
will be necessary to expand progenitor cells, without differentiation, 
at some steps of the process. Using the pancreatic lineage as a model 
for embryonic-stem-cell differentiation, we demonstrate that this is 
a solvable problem. Co-culture with organ-matched mesenchyme 
permits proliferation and self-renewal of progenitors, without dif- 
ferentiation, and enables an expansion of more than a million-fold 
for human endodermal cells with full retention of their develop- 
mental potential. This effect is specific both to the mesenchymal cell 
and to the progenitor being amplified. Progenitors that have been 
serially expanded on mesenchyme give rise to glucose-sensing, 
insulin-secreting cells when transplanted in vivo. Theoretically, 
the identification of stage-specific renewal signals can be incorpo- 
rated into any scheme for the efficient production of large numbers 
of differentiated cells from stem cells and may therefore have wide 
application in regenerative biology. 

Several in vitro protocols have been devised to direct differentiation 
of pluripotent cells into mature cells of interest. Most successful 
approaches promote the transition of cells through a series of inter- 
mediates designed to mimic normal development’. In the pancreas, 
this entails progressing from embryonic stem cells (ESCs) (marked by 
expression of octamer-binding protein 4 (Oct4; also known as 
Pou5f1)) to definitive endoderm (marked by expression of the tran- 
scription factor SRY-box containing gene 17 (Sox17)), then pancreatic 
progenitors (marked by expression of the transcription factor pancre- 
atic and duodenal homeoboxl (Pdx1)), endocrine progenitors 
(marked by expression of the transcription factor neurogenin 3 
(Ngn3)), and finally mature B-cells (which express insulin; Fig. 1a). 
So far, most attention has focused on the signals responsible for direct- 
ing differentiation from one stage to the next. Here we focus on amp- 
lifying or renewing distinct progenitors at various steps along the 
pancreatic lineage. 

As the microenvironment has an important role in regulating 
the balance between renewal and differentiation for many pluripotent 
cells*°, we chose to co-culture ESC-derived progenitors with mesench- 
ymal or endothelial cells, both of which influence embryonic 
pancreatic development in vivo’*. Primary mesenchymal cell 
lines were established from embryonic, neonatal and adult mouse 
pancreas, intestine, liver and spleen, and from human pancreas 
(Supplementary Table 1). From a total of 68 primary samples, 16 
yielded spindle-shaped cells that could be passaged at least 10 times. 
This panel of mesenchymal cells was tested for the ability to promote 
self-renewal of two distinct and transient progenitor cells in the 
pancreatic lineage: definitive endoderm and endocrine progenitors 
(Fig. la). 
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Figure 1 | Screen for signals that expand definitive endoderm and 
endocrine progenitors. a, Schema for directed differentiation of }-cells and 
their progenitors. b, Number of Sox17-GFP™ cells or Ngn3-GFP™ cells after 
co-culture with primary mesenchyme lines (Mes1 through to Mes16), control 
endothelial cell lines (C1, C2), an epithelial cell line (C3), a fibroblast cell line 
(C4), MEFs (C5) or various ECM surfaces (ECM1, ECM2 and ECM3) for 6 
days. c, The number of cells (Sox17* and Ngn3 *) after 2 or 6 days of co-culture. 
P values were calculated using Student'’s t-test. Data represent the mean of two 
biological replicates + s.d. 
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Definitive endoderm (Sox17-positive (Sox17~) cells)? generated 
from mouse ESCs containing a Soxl17—GFP reporter and endocrine 
progenitors (Ngn3° cells)'° generated from mouse ESCs containing an 
Ngn3-GFP reporter were isolated using fluorescence activated cell 
sorting (FACS) (Supplementary Figs 1 and 2) and then co-cultured 
with the panel of mesenchymal cells, other endothelial, fibroblast or 
epithelial lineages, or various extracellular matrices (ECMs). After 
6 days in culture, Mes1 and Mes2 produced a 9.7-fold and 12.1-fold 
increase in the number of definitive endoderm cells, respectively 
(Fig. 1b). This effect is specific to the responding cell, as Ngn3°* cells 
did not substantially expand when cultured on Mes1 or Mes2 (Fig. 1b, 
c). Instead, a marked expansion of Ngn3°* cells occurs in the presence 
of two other mesenchymal lines, Mes3 and Mes4 (Fig. 1b). 

The increased number of Sox17* and Ngn3* cells (Fig. 1c) is the 
result of proliferation, not a selective survival effect, as shown by live- 
cell imaging (Fig. 2a) and phospho-histone H3 staining (Fig. 2b). 
Furthermore, the mesenchyme-mediated renewal of progenitors is 
not the result of induction of Sox17 (Fig. 2c, d), preferential attachment 
to mesenchymal cells (Supplementary Fig. 3) or a decrease in cell death 
(Supplementary Fig. 4). In contrast to co-culture with mouse embry- 
onic fibroblasts (MEFs), co-culture of Mes1 and Mes2 does not expand 
undifferentiated (Oct4*) mouse ESCs (Fig. 2e). These data suggest 
that the effects of the mesenchymal cells are specific to the responding 
population and not due to a generic mitogenic signal. 

The self-renewal signal provided by mesenchyme might be 
mediated by cell contact or by secreted signalling factors. We tested 
16 growth factors, including those expressed during endoderm 
development and those with widespread mitogenic effects, but no 
single factor or combination of factors was sufficient to reproduce 
the magnitude of effect of co-culture (Supplementary Fig. 5a). Next, 
we attempted to block the mesenchyme-mediated expansion by the 
addition ofa panel of 41 chemical inhibitors that cover a broad range of 
signalling pathways (Supplementary Fig. 5b). Some small molecules 
reduced the mesenchyme-mediated expansion by varying degrees, 
suggesting that there may be multiple pathways involved. Taken 
together, these data indicate that the expansion on mesenchyme is 
likely to be multifactorial. Future studies aimed at elucidating the 
mechanism behind mesenchyme-mediated expansion will delve 
further into the complexities of combinations of growth factors, 
extracellular matrix (ECM) proteins and chemical compounds. 

Self-renewal without differentiation has perhaps been best studied 
for ESCs, for which self-renewal is defined as the ability of a cell to 
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Figure 2 | Effects of mesenchyme are due to 
proliferation, not induction, and are specific to 
the responding cell type. a, Number of Sox17- 
GFP* cells over time during co-culture. 

b, Immunofluorescence staining of Sox17-GFP* 
cells after 48 h co-culture with Mes2 or gelatin. 

c, Number of Sox17-GFP induced from sorted 
Sox17-GFP "cells by 6 days of co-culture. 

d, Number of Sox17-GEP * cells induced from 
mouse ESCs by 6 days of co-culture. e, Number of 
Oct4-GEP* cells after co-culture. iMEF, irradiated 
MEFs; NHDF, normal human dermal fibroblasts. 
804G ECM, extracellular matrix from 804G cells. 
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repeatedly divide without loss of identity or functional potential’?”. 
We tested whether long-term self-renewal of both mouse and human 
ESC-derived endoderm can be achieved in vitro by serial passage on 
Mes1 or Mes2. We observed a 3-million-fold and 6-million-fold 
expansion of mouse Sox17* cells on Mes1 and Mes2, respectively, 
after 7 passages (Fig. 3a), and a 65-million-fold expansion of human 
Sox17*/FoxA2” cells on Mes2 after 9 passages (Fig. 3c; for data on 
mouse Sox17° cells that were successively sorted at each passage, see 
Supplementary Fig. 6). Global gene-expression analysis of mouse 
Sox17* cells expanded on Mes1 or Mes2 shows a very close con- 
cordance (R* = 0.92 and 0.96, respectively) between the average 
gene-expression level of all genes before and after 6 passages 
(Supplementary Fig. 7). The Sox17* endoderm expanded by mesench- 
yme continues to co-express markers of definitive endoderm 
(including forkhead box A2 (FoxA2) protein) and does not differenti- 
ate further, as indicated by no increase in the expression of pancreatic 
(Pdx1), intestinal (caudal type homeobox 2 (Cdx2))’* or lung (SRY- 
box containing gene 2 (Sox2))'* markers (Supplementary Fig. 8a, b). 

To address further the question of whether cellular identity is pre- 
served, we assessed the developmental potential of amplified Sox17* 
cells. Mouse ESC-derived endoderm cells, expanded on mesenchyme 
for six passages, showed no loss in their capacity for differentiation into 
pancreatic progenitors (marked by expression of Pdx1), endocrine 
progenitors (marked by expression of Ngn3) and f-like cells (marked 
by expression of C-peptide) (Supplementary Fig. 9). The efficiency 
of induction towards pancreatic lineages did not change between 
unpassaged (PO) and late passage mouse ESC (P6) or human (P8) 
ESC-derived endoderm as judged by the percentage of Pdxl*, 
Ngn3*, or C-peptide cells at each stage (Fig. 3b and 3d). Culture 
with mesenchyme thus permits mouse and human endoderm self- 
renewal, defined as long-term expansion without alteration of the 
pattern of gene expression or developmental potential. 

Finally, we subjected the expanded, human ESC-derived pancreatic 
cells to a stringent test: whether they can form insulin-expressing, 
glucose-responsive cells in vivo. The most efficient published protocols 
for in vitro differentiation of pluripotent cells to B-cells yield only a 
small percentage (typically 0-15%) of insulin-positive cells, and these 
cells do not secrete insulin in a glucose-responsive manner. Thus, to 
test physiologic potential, stem cells are differentiated in vitro to a 
progenitor stage and then implanted in vivo where they ‘mature’ to 
functional cells’. Human ESCs were differentiated in vitro to definitive 
endoderm and then expanded on mesenchyme for 3 to 7 passages 
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(Fig. 4a). This expanded endoderm was then differentiated further 
in vitro to pancreatic progenitors and endocrine progenitors. Each cell 
type (expanded endoderm, pancreatic progenitors differentiated 
from expanded endoderm and endocrine progenitors differentiated 
from expanded endoderm, as well as unpassaged controls for each 
of the respective stages) was injected under the kidney capsule of 
SCID-Beige mice and allowed to mature in vivo. Before implantation, 
an aliquot of cells was fixed and stained to assess the state of dif- 
ferentiation. As expected, very few insulin (C-peptide)-expressing cells 
were detected at the endoderm and pancreatic progenitor stages in 
vitro (before transplantation; data not shown) and few (<5%) 
C-peptide* cells were detected at the endocrine progenitor stage 
(Supplementary Fig. 10). 

All stages of human ESC-derived cells that were expanded by 
mesenchyme gave rise to Pdxl", C-peptide* (insulin-expressing) 
cells when transplanted in vivo. Immunofluorescence showed Pdx1, 
C-peptide and insulin co-expression in grafts of both unpassaged and 


Ngn3* 
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Figure 3 | Long-term expansion of 
differentiation-competent mouse and human 
ESC-derived endoderm in the presence of 
mesenchyme. a, Number of mouse Sox17-GFP* 
cells shown in relation to co-culture time. 

b, Efficiency of directed differentiation of 
unpassaged and passaged mouse Sox17-GFP* 
cells. c, Number of human Sox17° cells shown in 
relation to co-culture time. d, Efficiency of directed 
differentiation of unpassaged and passaged human 
Sox17* cells. Data represent mean of biological 
replicates + s.d. P values are based on two-tailed 
Student’s t-test. Px, passage number; 6 days 
(mouse) or 5 to 8 days (human) between passages. 
Asterisk denotes progressive loss of cells cultured 
on ECM alone. 
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passaged endoderm. Representative images for P7 passaged endoderm 
and its derivative P7 pancreatic progenitors are shown in Fig. 4b. In 
addition, human C-peptide was detected in the plasma of animals that 
received grafts of both unpassaged and passaged endoderm, pancreatic 
progenitors and endocrine progenitors (Supplementary Fig. 11). No 
C-peptide was detected in animals that received negative controls (in 
which PBS or mesenchyme alone were implanted). Human islets were 
also used as a positive control for engraftment, survival and function. 

Most importantly, the implanted pancreatic progenitors from 
unpassaged and passaged endoderm secreted human C-peptide in a 
glucose-responsive manner (Fig. 4d). Animals that had been engrafted 
with pancreatic progenitors or controls were fasted for 16h, then 
challenged by glucose injection. Negative controls included animals 
into which PBS or mesenchyme alone had been engrafted; human 
islets served as the positive control. The human islet controls show 
that this in vivo implantation assay has an inherent variability owing to 
difficulties in delivering the same number of cells to the kidney capsule, 


Figure 4 | Human ESC-derived cells expanded 
on mesenchyme give rise to insulin-expressing, 
glucose-responsive cells in vivo. a, Schematic 
depicting implantation of human ESC-derived 
progenitors. b, Immunofluorescence staining of 
human ESC-derived endoderm, passaged seven 
times on mesenchyme and engrafted for 3 months 
(top panel) or further differentiated to Pdx1* stage 
and then engrafted for 2 months (bottom panel). 
c, Glucose-tolerance test of animals implanted with 
PBS or mesenchyme only, human islets or Pdx1* 
pancreatic progenitors derived from unpassaged 
(PO), or passaged (P4 or P7) human endoderm. 

d, Fasting- and glucose-induced (45 min glucose) 
plasma human C-peptide levels. Pairs of bars 
represent two time points per animal; data 
represent mean of two technical replicates + s.d. 
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as well as their engraftment and survival. The similarity of glucose- 
stimulated insulin secretion for the ESC-derived populations and 
human islet controls is notable, given that similar numbers of both 
cell types were implanted but the human islets have a much higher 
starting proportion of mature, insulin-expressing cells compared to 
the mixed population of pancreatic progenitors. In addition, glucose- 
tolerance tests revealed that compared to control animals, animals that 
had received Pdx1*-stage pancreatic progenitors (either passaged or 
unpassaged) displayed a lower peak blood glucose, at levels similar to 
subjects that had received human islets (Fig. 4c). These in vivo 
implantation experiments provide evidence that mesenchyme-derived 
signals not only expand cells in vitro but give rise to cells that are 
physiologically relevant and functional in an in vivo context. 

During embryogenesis, specification of progenitors is followed by 
amplification and further differentiation, and the balance between the 
two is probably responsible for determining the final organ size’’. We 
show here that these two steps, renewal and differentiation, can be 
effectively uncoupled in vitro, enabling the separate control and 
manipulation of each step. This approach permits expansion of pro- 
genitors to an extent that may exceed that which occurs in normal in 
vivo development. Although we used the pancreatic lineage as a model, 
amplification of progenitors using organ-matched mesenchyme could 
be applicable for other tissue types and facilitate progress towards the 
goals of regenerative medicine. 


METHODS SUMMARY 

Progenitor-mesenchyme co-culture. Mouse or human ESCs were differentiated 
to either the definitive endoderm or endocrine progenitor stages, then cultured on 
top of mitotically inactivated, primary mesenchymal cells that had been derived 
from mouse or human tissue from various stages of development. As controls, 
progenitor cells were cultured on ECM surfaces or on mitotically inactivated 
control cell lines. The number of progenitors at the end of 6 days was assessed 
using immunofluorescence (in the case of human progenitors) and/or FACS 
analysis (when mouse ESC reporter lines were used). The following mouse ESC 
lines were used: Sox17-GFP'*'’, Ngn3-GFP”® and Oct4-GFP (Oct4—GFP ESC 
lines were derived from Oct4—GFP mice from Jackson Laboratories)'*. The human 
ESC line used was HUESS8. 

Growth factor and chemical screens. Recombinant growth factors were 
resuspended according to the manufacturer’s instructions and used at a concen- 
tration of 20ngml ‘and 50ngml’. Chemicals were resuspended in DMSO 
(dimethylsulphoxide) at 10mM concentration to prepare stock solutions. 
Human endoderm was co-cultured with Mes2 overnight, then treated with each 
compound at 141M and 10 uM final concentrations, with duplicates for both. 
After 6days in the presence of compounds, cells were fixed and stained for 
Soxl7 and FoxA2. Automated imaging and quantification were carried out 
using the ArrayScan (Cellomics), with at least 40 fields of view per well imaged 
in a 96-well plate. 

Implantation of human ESC-derived progenitors in vivo. Human ESC-derived 
progenitors from multiple stages of differentiation were cultured alone (unpas- 
saged) or co-cultured with Mes2 for 3 to 7 generations (passaged). Cells were then 
concentrated to a small volume corresponding to approximately 1.5 million cells 
in 50 pl of media for injection into the kidney capsule of SCID-Beige mice. Starting 
as early as 4 weeks post surgery, mice were administered glucose tolerance tests in 
which glucose was injected intraperitoneally (3 gkg_'). 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mouse ESC culture and differentiation towards the pancreatic lineage. The 
following mouse ESC lines were used: Sox17-GFP'*"”, Ngn3-GFP"® and Oct4- 
GFP (Oct4—GFP ESC lines were derived from Oct4—GFP mice from Jackson 
Laboratories)'*. Undifferentiated Sox17—GFP, Ngn3-GFP and Oct4—GFP mouse 
ESCs were maintained on irradiated mouse embryonic feeders (iMEFs) in DMEM 
(Invitrogen) supplemented with 15% defined fetal bovine serum (FBS; HyClone), 
0.1mM non-essential amino acid (NEAA), 1X Glutamax, 1X penicillin-strep- 
tomycin (all Invitrogen), 0.055 mM 2-mercaptoethanol (Sigma) and 5 X 105 units 
LIF (Chemicon), as described’®. Cells were passaged every 3 to 4 days using 0.25% 
trypsin-EDTA (Gibco). Prior to differentiation, the iMEFs were depleted by 
incubating the mouse ESC and iMEF suspension on gelatin-coated plates. After 
30 min incubation, the supernatant containing predominantly mouse ESCs was 
collected and seeded onto a new, gelatin-coated plate at 50,000 to 60,000 cells 
per cm’ in mouse ESC media supplemented with Rho-associated kinase inhibitor 
(Stemgent). After overnight incubation, the medium was switched to RPMI 1640 
(Invitrogen), 1X penicillin-streptomycin (Invitrogen) and 1x Glutamax 
(Invitrogen) for 30h. Media containing 0.2% FBS and either 100ngml' recom- 
binant Activin A (AA) and 20 ngml | Wnt3a (both from R&D Systems) or 
300nM IDE2 (ref. 16) was then added. In initial experiments, we compared 
endoderm differentiated using growth factors to endoderm differentiated with 
IDE2, and we found no noticeable differences in the expansion on mesenchyme 
(data not shown). The medium was changed every other day and at day 6, Sox17*/ 
FoxA2* cells were quantified by FACS and immunofluorescence. For purification 
of mouse ESC-derived endoderm, differentiated cells were briefly trypsinized, 
quenched with RPMI containing 5% FBS and resuspended in PBS containing 
5% FBS for purification by flow cytometry. Flow cytometric sorting was performed 
using either a FACSAria (Becton Dickinson) or a MoFlo (Dako Cytomation) 
machine. After sorting, cells were concentrated by centrifugation and resuspended 
in RPMI containing 2% FBS, 1 penicillin-streptomycin and 1x Glutamax 
(Invitrogen) before re-plating. 

Differentiation of definitive endoderm to endocrine progenitors was performed 

using growth factors and small molecules based on previous studies’? *! and 
empirical experience. Differentiation to the primitive gut tube stage was carried 
out for 2 days in RPMI 1640 supplemented with 2% FBS, 1x Glutamax, 1 
penicillin-streptomycin, 50ng ml~' FGF7 (R&D Systems) and 0.25 uM SANT 
(Tocris Biosciences). Differentiation to posterior foregut endoderm was carried 
out for 4 days in DMEM supplemented with 1X B27 (Invitrogen), 1 Glutamax, 
1X penicillin-streptomycin, 50 ng ml ' FGEF7, 0.25 uM SANT and 2 uM retinoic 
acid (Sigma Aldrich). Differentiation to pancreatic endoderm was carried out in 
DMEM supplemented with 1x B27, 1X Glutamax, 1 penicillin-streptomycin, 
50 ng ml * FGE7, 0.25 WM SANT and 100 ng ml * noggin for 4 days. Differentiation 
to endocrine progenitors was carried out using 11M Alk5 inhibitor (StemGent). 
Medium was changed every other day for the duration of differentiation. 
Human ESC culture and differentiation towards the pancreatic lineage. 
HUESS cells were maintained as described”’. In brief, undifferentiated human 
ESCs were maintained on gelatin-coated plates with iMEFs in KO-DMEM 
(Invitrogen) supplemented with 10% Plasmanate (Talecris), 10% KOSR, 
0.1mM NEAA, 1X Glutamax, 1 penicillin-streptomycin, 5 ng ml! bEGE (all 
Invitrogen) and 0.055 mM 2-mercaptoethanol (Sigma). 

Cells were passaged at a ratio of 1:6 every 5days using 0.05% trypsin 
(Invitrogen) and plated in regular HUES media supplemented with ROCK 
inhibitor, Y-27632 (EMD, 101M). To generate definitive endoderm, HUESCs 
were cultured on iMEF feeder cells until 100% confluent, then treated with 20 ng 
ml! Wnt3a (R&D systems) and 100ng ml’ AA (R&D systems) in RPMI 
(Invitrogen) supplemented with 1 L-glutamine (Invitrogen) for 1 day, and then 
100ngml-' AA in RPMI supplemented with 1X L-glutamine and 0.2% FBS 
(Invitrogen). Two days later, the medium was changed to 50ngml ' FGE7 
(R&D systems) in RPMI supplemented with 1X L-glutamine and 2% FBS, and 
maintained for an additional 2 days. Cells were then transferred to 100ng ml * 
noggin, 0.25 uM KAAD-CYC and 21M RA (Sigma) in DMEM supplemented 
with 1X L-glutamine and 1% B27 and cultured for an additional 4 days. To induce 
endocrine differentiation, cells were transferred to 100ngml | noggin,100nM 
PdBu (EMD) and 1uM Alk5 receptor tyrosine kinase inhibitor II (Axxora) in 
DMEM supplemented with 1x L-glutamine and 1% B27 for 4 days. Cells were 
then treated with 100ngml ' noggin and 1M AlkS inhibitor for additional 
3 days. All DMEM are high-glucose DMEM. 

Global gene-expression analysis by microarray. Sox17* cells were analysed 
before and after 6 passages on mesenchyme. The Sox17-GFP* cells were sorted 
from unpassaged endoderm or from mesenchyme co-cultures and collected 
directly into RLT buffer (Qiagen). Total RNA was isolated using Qiashredder 
and RNeasy Mini Kit (Qiagen). Biotinylated complementary RNA was prepared 
from =100 ng of isolated RNA using Illumina TotalPrep RNA Amplification Kit 
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(Ambion) and hybridized to the Illumina mouse genome Bead Chips (Mouse 
Ref-8). Samples were prepared as technical duplicates. Data were acquired with 
Illumina Beadstation 500 and were evaluated using BeadStudio Data Analysis 
Software (Illumina). 

Microarray data were deposited in the Gene Expression Omnibus Database of 

the National Centre for Biotechnology Information, under the accession number 
GSE25337. 
Immunofluorescence. Cultured cells were fixed with freshly made 4% para- 
formaldehyde in PBS (Sigma) for 30 min at 4°C, followed by two brief washes 
in PBS. Cells were then blocked in 5% donkey serum (Jackson Immunoresearch) 
in 0.1% PBT (PBS with 0.1% Triton X-100) for 1 h at room temperature. Primary 
antibody was applied in blocking solution for overnight incubation at 4 °C. The 
next day, cells were rinsed 3 times, for 10 min each time, with 0.1% PBT before 
application of secondary antibody for 1h at room temperature. Cells were coun- 
terstained with DAPI (Sigma Aldrich) or Hoechst 3342 (Molecular Probes) to 
visualize nuclei and washed 3 times in 0.1% PBT, for 10 min each. Images were 
acquired with an Olympus [X70 microscope. Quantification was carried out using 
MetaMorph software (Molecular Devices), except for the estimation of phospho- 
histone H3 fluorescence, which was quantified manually based on single colour 
images. 

The specific antibodies and dilutions used were as follows: primary antibodies 
were goat anti-SOX17 (1:250; R&D Systems), goat anti-HNF3beta/FOXA2 
(M-20) (1:250; Santa Cruz Biotechnology), goat anti-PDX1 (1:500; R&D 
Systems), goat anti-SOX2 (Y-17) (1:250; Santa Cruz Biotechnology), mouse 
anti-CDX2 (1:400; Biogenex), sheep anti-neurogenin 3 (1:300, R&D Systems), 
guinea pig anti-insulin (1:500, DAKO), rabbit anti-C-peptide (1:500; Linco/ 
Millipore), rabbit anti-cleaved caspase 3 (1:1,000; Cell Signaling), and rabbit 
anti-phospho-histone H3 (1:100; Millipore); and secondary antibodies were 
Alexa488 or 594-conjugated donkey anti-rabbit, Alexa488, 594 or 647-conjugated 
donkey anti-goat, Alexa488-conjugated donkey anti-mouse, Alexa488-conjugated 
goat anti-sheep. All of these antibodies were from Molecular Probes and used at a 
dilution of 1:300. 

Derivation of primary mesenchymal cells. To derive primary embryonic 
mesenchymal cells, wild-type ICR (Taconic) mouse embryos were collected each 
day, between 12.5 and 18.5 days after fertilization. At each stage, the embryonic 
pancreas was removed, and in some cases, other endoderm-derived organs 
(spleen, liver or intestine) were also taken. After dissection, each rudiment was 
rinsed briefly in PBS and kept on ice. Forceps were used to transfer the tissue to a 
well of a 96-well plate, where it was kept at 37°C for 10 to 15min to allow 
attachment of the tissue to the tissue culture plate surface. Then, ‘mesenchyme 
media’ (DMEM:F12, 10% FBS, 1X _ penicillin-streptomycin, 1X Glutamax 
(Invitrogen)) was added to just cover the tissue. Tissue was thereafter kept at 37 °C. 

Over several weeks, medium was changed twice weekly, and wells were 
monitored for growth of mesenchymal cells from the tissue. Outgrowth of 
spindle-shaped cells was observed between 1 and 3 weeks after initial tissue 
collection, with a success rate of approximately 1 in every 4 derivations giving a 
successful outgrowth. Once confluent, the mesenchymal cells were trypsinized 
(thus separated from the initial tissue in the well) using 0.25% trypsin-EDTA 
and expanded until suitable for banking in liquid nitrogen. 

To isolate adult pancreatic mesenchyme, an adult wild-type mouse pancreas 
was perfused through the common bile duct using Collagenase P and Liberase 
(both Roche). Once perfused, the pancreas was dissected, digested for 15 min at 
37 °C, quenched, and the islet-containing fraction was purified using Histopaque 
(Sigma). The fraction enriched for islets was then plated onto tissue-culture-coated 
plates. Mesenchyme preferentially grew from this culture and was expanded until 
suitable for banking in liquid nitrogen. 

Human pancreatic samples were obtained from non-diabetic adult donors 
through the National Disease Research Interchange (NDRI), in accordance with 
Institutional Review Board guidelines. Fractions enriched for either islets or acinar 
tissue were plated onto tissue-culture-coated plates and allowed to expand as with 
the adult mouse islet-derived mesenchyme (above). 

In addition to primary mesenchymal cells, the following cell lines were used: 804 
bladder carcinoma cell line*’, as well as normal human dermal fibroblasts, MS1- 
VEGF”, and bEnd.3 (ref. 25) (all from ATCC). 

The mesenchymal lines were renumbered arbitrarily after it was determined 

which lines were most effective. 
Mesenchyme-progenitor co-culture. Co-culture of mesenchyme and ESC- 
derived progenitors was accomplished as follows: first, a multi-well plate was 
coated with 0.1% gelatin for 1h at 37 °C (ECM control wells were not pre-plated 
with gelatin.) Next, the gelatin was aspirated and mesenchymal cells were plated at 
a density of approximately 33,000 cells per cm? (25,000 cells per well of a 48-well 
plate). Cells were allowed to attach overnight, at which point they were mitotically 
inactivated for 2 h (see below). 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


After medium was aspirated, ESC-derived progenitors (20,000 cells per well of a 
48-well plate) were plated on top of the mesenchymal lines and kept in RPMI 1640 
supplemented with 2% FBS (Hyclone). Medium was changed every 2 days until 
analysis. 

For control wells, ECM (804G conditioned medium, laminin or gelatin alone) 
were directly plated onto tissue culture plates and allowed to incubate overnight 
before progenitor cells were added. 

Mitotic inactivation of mesenchymal cell lines. Mitomycin C (Sigma) was used 
to mitotically inactivate mesenchymal cell lines. Cells were incubated in DMEM/ 
F12 (Invitrogen), 0.2% FBS (Hyclone), 1X pencilin-streptomycin, 1 Glutamax 
and 20 pg ml 1 Mitomycin C for 2 h, then washed three times with PBS. Cells were 
maintained in mesenchyme media and used for experiments within 1 week. 
Live cell imaging and analysis. Live cell imaging was performed using an 
IncuCyte machine (Essen Bioscience) and quantitation was performed with the 
accompanying commercial software. 

Assessing induction of Sox17 by mesenchyme, and specificity of responding 
cell type. To assess whether mesenchyme induces Sox17 expression, we performed 
two sets of experiments (depicted in Fig. 2). First, mESCs were differentiated to the 
DE stage, and the Sox17-GFP fraction was plated onto various surfaces (Fig. 2c). 
Co-culture with Mes1 or Mes2 had no appreciable effect compared to controls on 
the percent of Sox17-GFP* cells that emerged after 6 days in culture, as measured 
by FACS. Similarly, undifferentiated mESCs plated directly onto Mes1 or Mes2 for 
6 days did not express Sox17 at a higher rate than controls (Fig. 2d). 

Furthermore, mesenchyme-mediated renewal is specific to the responding cell 

type, as Oct4-GFP* mESCs maintained on Mes] or Mes2 do not appreciably 
expand. mESCs containing an octamer binding protein (Oct4) GFP reporter 
construct were purified by FACS and cultured on Mes1, Mes2 or controls in 
basal medium containing no leukaemia inhibitory factor (LIF) for 6 days 
(Fig. 2e). Data are from two independent experiments, each containing experi- 
mental duplicates. 
Growth factor and chemical compound screen. Recombinant growth factors 
were resuspended according to the manufacturer’s instructions and used at a 
concentration of 20ng ml‘ and 50ng ml _*. Factors were epidermal growth fac- 
tor (EGF), fibroblast growth factor 10 (FGF10), keratinocyte growth factor (KGF), 
netrin 4, bone morphogenetic protein 4 (BMP4), endothelial growth factor, hepa- 
tocyte growth factor (HGF), dorso (dorsomorphin), growth differentiation factor 
8 (GDF8), noggin, vascular endothelial growth factor (VEGF), decorin, notch, 
interleukin-15 (IL-15), interleukin 7 (IL-7) and chemokine (CXC motif) ligand 
3 (CXCL3). All the recombinant proteins were purchased from R&D Systems, 
except for dorsomorphin, which was purchased from StemGent. 

Chemicals were all resuspended in DMSO at 10 mM concentration to prepare 
stock solutions. Human endoderm was co-cultured with Mes2 overnight, then 
treated with each compound at 1 uM and 10 uM final concentrations, with dupli- 
cates for both. After 6 days in the presence of compounds, cells were fixed and 
stained for Sox17 and FoxA2. Automated imaging and quantification was carried 
out using the ArrayScan (Cellomics), with at least 40 fields of view per well imaged 
in a 96-well plate. 

Preparation of cells for injection in vivo. Human ESCs were prepared for 
implantation as follows: ESCs were transferred to gelatin and differentiated to 
definitive endoderm, pancreatic progenitors and endocrine progenitors. In par- 
allel, a pool of the definitive endoderm cells were expanded between four and seven 
passages on Mes2, then transferred onto gelatin and further differentiated to either 
the pancreatic progenitor and endocrine progenitor stages; after expansion, frac- 
tions of each stage were used for implantations. When cells were at the correct time 
for injection, they were gently dissociated using TrypLE (Invitrogen) just until they 
rounded up. When needed, a cell scraper was used to carefully detach cells from 


the dish. Cells were neutralized with RPMI and 2% FBS, then concentrated to a 
small volume corresponding to approximately 1.5 million cells in 50 pl of media 
for injection. 

For human islet controls, cells were shipped on ice, within 24h of collection 

from the patient, and then allowed to recover overnight at 37 °C in the presence of 
CMRI (Invitrogen) and 10 mM glucose in low-attachment dishes. They were then 
harvested by simple collection of media and centrifugation, which resulted in the 
same concentration of cells per volume as used for the human ESC-derived popu- 
lations (above). 
Injections of cells in vivo. In brief, saline or cells were injected into the kidney 
capsule of male SCID-Beige animals (Harlan or Charles River) that were approxi- 
mately 7 weeks old (at least 21g). Animals were anaesthetized using Avertin 
(250 mgkg') delivered intraperitoneally under aseptic conditions. The surgical 
site was shaved and disinfected with both alcohol and betadine. Using a syringe to 
deliver cells, approximately 50 ,l of volume was injected just under the capsule of 
the left kidney. Post surgery, animals were administered 5 mgkg ' carprofen for 
2 days post-operatively. Mice were housed singly and observed at least 2 to 3 times 
per week for the appearance of visible tumours. 

Starting as early as 4weeks post surgery, mice were administered glucose- 
tolerance tests (see below). Animals were euthanized approximately 4 months 
after transplant, or if tumour burden became too great, whichever came first. 
Graft tissue was dissected from euthanized mice, washed in ice-cold PBS and fixed 
in 4% PFA, placed in 30% sucrose and embedded in OCT for later sectioning and 
staining of tissue. 

All animal experiments were performed in accordance with the Harvard 
University International Animal Care and Use Committee (IACUC) regulations. 
Glucose-tolerance tests. Mice were fasted overnight (16h) with water only, in 
cages in which wire mesh flooring separated the animal from its bedding. Glucose 
was injected intraperitoneally (3 kg") and blood samples were taken from the 
tail and collected into heparin-coated tubes (Braintree Scientific) both before (TO) 
and 45 min after the injection of glucose. Blood glucose was also measured using 
an Ultra Mini glucometer (One Touch) at T= 0 and every 15 min thereafter, to 
ensure that glucose delivery had been achieved. 

After collection into heparin-coated microtubes, blood was spun briefly and the 
supernatant was taken and frozen at —80 °C until analysis. 

Levels of human C-peptide were measured using an ELISA kit specific to 
human, not mouse, C-peptide (ultrasensitive C-peptide human ELISA kit; 
Mercodia). Age-matched controls included samples from animals that had 
received saline only, mesenchyme only (negative controls) or human islets 
(positive controls), and that had been subjected to glucose-tolerance tests in 
parallel. 
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Impaired intrinsic immunity to HSV-1 in human 
iPSC-derived TLR3-deficient CNS cells 
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In the course of primary infection with herpes simplex virus 1 
(HSV-1), children with inborn errors of toll-like receptor 3 
(TLR3) immunity are prone to HSV-1 encephalitis (HSE)'*. We 
tested the hypothesis that the pathogenesis of HSE involves non- 
haematopoietic CNS-resident cells. We derived induced pluripo- 
tent stem cells (iPSCs) from the dermal fibroblasts of TLR3- and 
UNC-93B-deficient patients and from controls. These iPSCs were 
differentiated into highly purified populations of neural stem cells 
(NSCs), neurons, astrocytes and oligodendrocytes. The induction 
of interferon-f (IFN-B) and/or IFN-/1 in response to stimulation 
by the dsRNA analogue polyinosinic:polycytidylic acid (poly(I:C)) 
was dependent on TLR3 and UNC-93B in all cells tested. However, 
the induction of IFN-B and IFN-/.1 in response to HSV-1 infection 
was impaired selectively in UNC-93B-deficient neurons and oligo- 
dendrocytes. These cells were also much more susceptible to HSV-1 
infection than control cells, whereas UNC-93B-deficient NSCs and 
astrocytes were not. TLR3-deficient neurons were also found to be 
susceptible to HSV-1 infection. The rescue of UNC-93B- and TLR3- 
deficient cells with the corresponding wild-type allele showed that the 
genetic defect was the cause of the poly(I:C) and HSV-1 phenotypes. 
The viral infection phenotype was rescued further by treatment 
with exogenous IFN-a or IFN-f ( IFN-o/B) but not IFN-11. Thus, 
impaired TLR3- and UNC-93B-dependent IFN-a/B intrinsic 
immunity to HSV-1 in the CNS, in neurons and oligodendrocytes 
in particular, may underlie the pathogenesis of HSE in children with 
TLR3-pathway deficiencies. 

Childhood HSE is a rare, life-threatening, central nervous system 
(CNS)-restricted complication of primary infection with HSV-1, an 
almost ubiquitous virus that is typically innocuous*. Children with 
HSE are not unusually susceptible to other infectious agents, including 
viruses, or even to HSV-1-related diseases affecting sites other than 
the CNS*°. HSV-1 reaches the CNS from the oral and nasal epithelium, 
via the cranial nerves*. We identified autosomal recessive UNC-93B 
deficiency as the first genetic aetiology of childhood HSE’. UNC-93B is 
required for TLR3, TLR7, TLR8 and TLR9 responses’*. We then iden- 
tified autosomal-recessive or autosomal-dominant deficiencies of 
TLR3 (refs 2 and 3), TRAF3 (ref. 7), TRIF* and TBK] (ref. 9), revealing 
that childhood HSE can be due to the impairment of TLR3 immunity. 


HSV-1 produces double-stranded (dsRNA) during its replication’? 
and the dsRNA-sensing TLR3 is expressed and functional in 
non-haematopoietic (neurons, astrocytes, oligodendrocytes) and hae- 
matopoietic (microglia) CNS-resident cells, which produce IFN-B and 
IFN-A in response to TLR3 stimulation'"’’ and can be infected with 
HSV-1 in vitro’’"*". We therefore surmised that the pathogenesis 
of HSE in patients with TLR3-pathway deficiencies may involve 
impaired TLR3-dependent IFN-«/B and/or IFN-A immunity to 
HSV-1 in the CNS. 

We tested this hypothesis by generating induced pluripotent stem 
cells (iPSCs) from control, UNC-93B- and TLR3-deficient dermal 
fibroblasts (Supplementary Table 1). We first derived and fully charac- 
terized iPSC lines from a healthy child, from an HSE child with auto- 
somal-recessive complete UNC-93B deficiency’, and from a patient with 
autosomal-recessive complete TLR3 deficiency’, by reprogramming 
primary dermal fibroblasts, as described previously” (Supplementary 
Fig. 1 and Methods). A robust stemness and pluripotency profile, kar- 
yotypic integrity and patient-specific origin of the iPSCs were confirmed 
(Supplementary Fig. 1). Whole-exome sequencing for one control, one 
TLR3-deficient and two UNC-93B-deficient iPSC lines revealed more 
than 99.9% concordance with the corresponding parental fibroblast 
lines, in terms of exonic genetic variability (Supplementary Table 2). 
No synonymous or non-synonymous germline and somatic rare variants 
of any of the 21 known TLR3-pathway genes were found in parental 
fibroblast and iPSC lines, respectively (Supplementary Table 3). We also 
used two additional, previously described healthy control iPSC lines*"”” 
for subsequent experiments (Supplementary Table 1). 

We next induced the differentiation of iPSCs into all major non- 
haematopoietic CNS-resident cells, including neural stem cells (NSCs), 
neurons, oligodendrocytes and astrocytes**™*. The selective derivation of 
each individual neural lineage required a multistep iPSC-differentiation 
approach combined with fluorescence-activated cell sorting (FACS)- 
mediated cell purification (Fig. 1a). Neural differentiation of UNC- 
93B-deficient iPSCs, TLR3-deficient iPSCs, control iPSCs and control 
human embryonic stem cells (hESCs) (H9 line) was induced by dual 
SMAD inhibition*’” (Fig. 1a and Supplementary Fig. 2a). The resulting 
polarized columnar neuroepithelial structures expressed PLZF, ZO1 
and PAX6, well-known markers of neural rosettes”*. Mechanically 
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Figure 1 | Derivation and purification of CNS cells. a, Schematic diagram of 
the differentiation and purification protocols used. b, Representative phase- 
contrast images showing the morphology of the human pluripotent stem cells 
(hPSCs), neural rosettes and NPC clusters, from healthy control hESCs, healthy 
control iPSCs (C* (iPSC)) and UNC93B1~'~ iPSCs. Immunocytochemistry 
analysis revealed the expression of rosette markers (PLZF and ZO1) and a 
forebrain marker FOXGI. c, Characterization of UNC93B1 ’~ iPSC-derived 
CNS cell types. Upper panels, phase-contrast images showing the characteristic 
morphology of each type of neural cell; lower panels, immunofluorescence 


passaged rosettes retained expression of the forebrain marker FOXG1 
(Fig. 1b) and yielded neural precursor cells (NPCs) in the presence of 
FGF2 and EGF (Fig. 1b and Supplementary Fig. 2a). Through dif- 
ferential growth-factor treatment and the use of cell-type-specific 
surface markers in NPC-stage cells, we were able to isolate highly 
enriched populations of NSCs, neurons, astrocytes or oligodendrocyte 
lineage cells (Fig. lc and Supplementary Fig. 2b, c). 

The purified hESC- and iPSC-derived neuronal populations 
expressed additional lineage-specific markers (Supplementary Fig. 3a) 
and showed the electrophysiological properties of functional neurons 
(Supplementary Fig. 3b-e). The identity of the purified glial fibrillary 
acidic protein (GFAP)-expressing and O4-antigen-expressing glial 
cell populations was confirmed by global gene expression profiling 
(Supplementary Fig. 4a, b). Immunocytochemistry for stage-specific 
markers was used to identify the purified O04 ‘oligodendrocytes’ 
used throughout this study as a mostly immature (pre-myelinating) 
oligodendrocyte population (Supplementary Fig. 4c-e, Supplementary 
Table 4). Quantitative analysis showed that our preparations of NSCs, 
neurons, astrocytes and oligodendrocytes were highly pure (Fig. 1d), 
and similar gene-expression profiles were obtained for neurons 
and astrocytes derived from disease-specific and control cell lines 
(Supplementary Fig. 5). Our in vitro CNS cell-differentiation system 
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analysis for markers of neural stem cells (nestin), neurons (TUJ1), 
oligodendrocyte progenitor cells (04) and astrocytes (GFAP). 

d, Quantification of marker expression for each neural cell type derived from 
control hESCs, UNC93B1~’~ iPSCs and control iPSCs (error bars, s.e.m.). Scale 
bars represent 100 um (b), 50 jum (c; except for 04 staining). High SHH, high 
concentration of recombinant Sonic hedgehog (C25II (Cys25Ile-Ile)) at 

20 ng ml '; low SHH, low concentration of recombinant Sonic hedgehog at 
125ng ml '; N/S, noggin and $B431542 (dual SMAD inhibition). 


therefore constitutes a reliable platform for the comparative assess- 
ment of CNS cell-specific antiviral immunity. 

TLR3 expression has been documented in neurons derived in vitro 
from a human teratocarcinoma cell line’’, and in primary cells, either 
in human brain tissues in situ (neurons’’) or isolated from human 
brain ex vivo (oligodendrocytes and astrocytes'*"**), but not in human 
NSCs (self-renewing, multipotent cells responsible for generating 
neurons, astrocytes and oligodendrocytes in the CNS)”. We detected 
messenger RNA for key genes of the TLR3-responsive pathway, 
including TLR3 and UNC93B1, in all four cell types tested (Sup- 
plementary Fig. 6a-d), whether differentiated from iPSCs or hESCs. 
Wealso detected mRNAs for genes encoding key molecules in the IFN- 
responsive pathways, including the receptors for IFN-o./B and IFN-A, 
in these cells (Supplementary Fig. 6e-h). Levels of mRNA for the 
TLR3- and IFN-responsive pathway genes tested were similar, for each 
CNS cell type, between cells differentiated from control iPSCs, control 
hESCs and UNC-93B-deficient iPSCs, with the expected exception of 
UNC93B1, for which mRNA levels were lower in UNC-93B-deficient 
cells (Fig. 2a and Supplementary Fig. 6a—d). 

We then studied TLR3 responses in the same cells. IFN-41 and IFN- 
B were induced in a time-dependent manner, by stimulation with 
the non-specific TLR3 agonist poly(I:C), a dsRNA analogue, in NSCs, 
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Figure 2 | UNC-93B-dependent IFN responses to TLR3 in neurons and glial 
cells. a, UNC93B1 mRNA levels in CNS cells differentiated from hESCs froma 
healthy control (C* (hESC)) and iPSCs from a healthy control (C* (iPSC)), 
and an UNC-93B-deficient patient (UNC93B1 ~/~)_ NS, not stimulated. 

b, c, IFN-B (b) or IFN-A1 (c) mRNA induction, after 6h of poly(I:C) 
stimulation, in neurons (b) or oligodendrocytes (c) differentiated from one 
healthy control hESC line, one healthy control iPSC line, and in UNC93B1 == 
neurons (b) or oligodendrocytes (c), without lentiviral infection, or after 
infection with a lentivirus containing human wild-type UNC93B1 
(UNC93B1 '~ + UNC93B1) or a mock construct (UNC93B1 ‘~ + mock). 
d, IFN-B (upper panel) or IFN-A1 (lower panel) mRNA induction, after 4h 
(upper panel) or 6h (lower panel) of poly(I:C) stimulation, in astrocytes 
differentiated from hESCs from a healthy control, in UNC93B1 = astrocytes, 
without lentiviral infection, or after infection with a lentivirus containing 
human wild-type UNC93B1 (UNC93B1_'~ + UNC93B1) or a mock construct 
(UNC93B1_'~ + mock). Mean values = s.e.m. were calculated from three 
(a) or two (b-d) independent experiments, each carried out in duplicate. 
Analysis of variance (ANOVA) was carried out for UNC93B1 mRNA 
expression levels shown in a. When significant, Dunnett’s t-tests were 
performed for two-by-two comparisons. *P < 0.05, **P < 0.01 (comparisons 
between each control line and the patient line, for each cell type). 


neurons, oligodendrocytes and astrocytes differentiated from heal- 
thy control iPSCs or hESCs, but not in UNC-93B-deficient iPSC- 
differentiated CNS cells (Fig. 2b-d and Supplementary Fig. 6i-I). The 
induction of NF-«B1, a key IFN-inducing molecule, and that of MX1, a 
key IFN-inducible molecule, were both impaired in UNC-93B-defi- 
cient oligodendrocytes after poly(I:C) stimulation (Supplementary Fig. 
6m, n). The impaired response to poly(I:C) in UNC-93B-deficient CNS 
cells was consistent with our previous data for UNC-93B-deficient 
fibroblasts, pointing to a TLR3-dependent response to dsRNA in these 
cell types’. Moreover, impaired poly(I:C) responsiveness in UNC-93B- 
deficient neurons, oligodendrocytes and astrocytes was rescued by 
transient expression of the human UNC93B1 gene (Fig. 2b-d). Thus, 
the UNC-93B-dependent TLR3 pathway is functional in control 
human iPSC-derived NSCs, neurons, astrocytes and oligodendrocytes, 
at least for the induction of antiviral IFN-.1 and IFN-f in response to 
poly(I:C). 
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We thus set out to compare the response to HSV-1 in UNC-93B- 
deficient and control iPSC or hESC-derived CNS cells after infection 
with HSV-1 and HSV-1-GFP (green fluorescent protein)’**. Human 
NSCs and astrocytes seemed to be more susceptible to HSV-1 infection 
than neurons and oligodendrocytes, as massive HSV-1-GFP replica- 
tion was observed earlier (Supplementary Fig. 7 and data not shown). 
UNC-93B-deficient NSCs and astrocytes derived from two different 
iPSC lines also showed high levels of HSV-1-GFP replication, similar to 
those observed in the corresponding cell types derived from two con- 
trol iPSC lines and the control hESC line (Fig. 3a—c and Supplementary 
Figs 8a—c and 9a-c). Treatment of UNC-93B-deficient and control 
NSCs and astrocytes with recombinant IFN-«2B or IFN-B, but not 
IFN-A1, decreased HSV-1-GFP replication levels (Supplementary 
Figs 8a, c-fand 9a, c-g). Moreover, treatment with poly(I:C) decreased 
HSV-1-GFP replication levels in control, but not in UNC-93B- 
deficient astrocytes (Supplementary Figs 9f and 10a—d). By contrast, 
treatment with agonists of TLR9 (CpG-A or CpG-C) did not have such 
an effect (Supplementary Fig. 10a—d). 

When UNC-93B-deficient neurons from the two UNC-93B-defi- 
cient iPSC lines were infected with HSV-1-GFP, HSV-1-GFP replica- 
tion was faster, reaching higher levels than in neurons differentiated 
from four control iPSC lines or one hESC line (Fig. 3a, d, e and 
Supplementary Fig. 11a). The treatment of UNC-93B-deficient neu- 
rons with IFN-«2B or IFN-B, but not IFN-(1, rescued this phenotype 
(Supplementary Fig. 11a). Similar results were obtained with TLR3- 
deficient neurons that had been differentiated from TLR3-deficient 
iPSCs*”° (Fig. 3e and Supplementary Fig. 11b). The phenotype of 
enhanced HSV-1 replication in UNC-93B- and TLR3-deficient neu- 
rons was rescued by expression of the wild-type human UNC93B1 and 
TLR3 genes, respectively (Fig. 3f and Supplementary Fig. 11b). Finally, 
higher levels and faster replication of HSV-1-GFP were also observed 
in UNC-93B-deficient oligodendrocytes than in oligodendrocytes dif- 
ferentiated from control iPSCs or hESCs, and this phenotype was 
rescued by treatment with IFN-«2B or IFN-B, but not IFN-A1 
(Fig. 3a, g and Supplementary Fig. 11c, d). This is consistent with 
our previous finding of high susceptibility to HSV-1 and VSV in fibro- 
blasts with TLR3-pathway deficiencies, associated with an impair- 
ment of the TLR3-dependent induction of IFN-B and IFN-A, which 
can be rescued more effectively by exogenous IFN-o/ than by IFN-A1 
(refs 1-3). 

We studied further the production of IFNs and inflammatory cyto- 
kines in UNC-93B-deficient and control CNS cells after infection with 
HSV-1. UNC-93B-deficient neurons, astrocytes and oligodendrocytes 
produced normal amounts of interleukin-6 (IL-6), as shown by com- 
parison with control iPSC- or hESC-differentiated CNS cell types 
(Supplementary Fig. 12a-c). UNC-93B-deficient NSCs and astrocytes 
seemed to produce detectable but lower levels of IFN-1 than the 
control cells tested (Supplementary Fig. 12d, e), suggesting that UNC- 
93B-independent, TLR3-independent partially compensatory pathways 
may be involved in triggering IFN responses to HSV-1 in human NSCs 
and astrocytes. The induction of IFN-8 or IFN-A1 was readily observed 
in control iPSC- or hESC-differentiated neurons or oligodendrocytes, 
but was greatly impaired in UNC-93B-deficient neurons and oligoden- 
drocytes, respectively (Supplementary Fig. 12f, g). The induction of MX1 
was also impaired in UNC-93B-deficient oligodendrocytes after HSV-1 
infection (Supplementary Fig. 12h). Thus, neurons and oligodendro- 
cytes lacking UNC-93B may be highly vulnerable to HSV-1 infection 
because of an impairment of the cell-autonomous IFN-«/B or IFN-A 
immunity. 

Our findings suggest that neurons and oligodendrocytes provide 
strong, intrinsic protective anti-HSV-1 immunity in the CNS, through 
an intact TLR3 pathway. Although NSCs and astrocytes lacking UNC- 
93B are not more susceptible to HSV-1 infection than control cells 
in vitro, they may have a role in protective anti- HSV-1 immunity in the 
CNS in vivo. Indeed, mouse astrocytes rely on TLR3 to control HSV-2 
(ref. 29). Haematopoietic cell-derived microglial cells, which also 
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Figure 3 | High HSV-1 susceptibility in UNC-93B-deficient neurons and 
oligodendrocytes. Infection with HSV-1-GFP for 24h, at a multiplicity of 
infection of 1, was carried out. a, GFP expression in CNS cells differentiated from 
UNC93B1 ‘~ iPSCs or from hESCs from a healthy control (C*) was visualized 
using fluorescence microscopy. Phase-contrast photomicrographs from the same 
view are also shown. b-d GFP expression in NSCs (b), astrocytes (c) and neurons 
(d) differentiated from UNC93B1~/~ iPSCs, from hESCs from a healthy control 
(C* (hESC)) and one to two iPSC lines each from up to three healthy controls 
(C* (iPSC)), was assessed with a fluorescence plate reader. The difference in GFP 
intensity between HSV-1-GFP-infected cells and uninfected cells is shown. 

e, GFP expression in neurons differentiated from two lines (C and H) of 
UNC93B1 ‘~ iPSCs, one line of TLR3 ‘~ iPSCs, a total of four iPSC lines from 
three healthy controls, one or two lines from each, or from one C* hESCs line. 


express TLR3 and can be infected with HSV-1 (refs 12, 14, 17), may 
also contribute to HSE. CNS-intrinsic mechanisms are thus vital for 
the control of HSV-1 in the course of primary infection in childhood. 
These new findings add to our previous results that indicated that most 
leukocytes and keratinocytes from HSE patients with TLR3-pathway 
deficiencies respond normally to stimulation with poly(I:C) or HSV-1 
(refs 2, 3 and 9), and are consistent with the CNS-restricted pattern of 
lesions during childhood HSE, with no disseminated disease. Human 
non-haematopoietic cells may be the key to survival during the course 
of primary infection, extending the concepts of host defence beyond 
innate and adaptive haematopoietic immunity, to non-haematopoietic 
‘intrinsic immunity”. 


METHODS SUMMARY 

Human iPSC generation. Fibroblasts from patients or controls were transduced 
with the polycistronic stem cell cassette (STEMCCA) lentiviral vector and cultured 
in DMEM medium supplemented with 10% fetal calf serum (FCS), 2mM 
L-glutamine, penicillin (50U ml!) and streptomycin (50 pg ml‘) as described 
previously”. After 72 h, cells were transferred onto irradiated mouse embryonic 
fibroblasts (iMEFs) and the medium was replaced with hESC medium. iPSC 
colonies with an ESC-like morphology were mechanically isolated 4 to 5 weeks 
after infection. 

Neural differentiation. We adapted previously described protocols for neural 
differentiation*'~*. Rosettes were collected on day 8 and cultured further to give 
clusters of proliferating NPCs. nestin*/TUJ1~ NSCs were maintained in N2 med- 
ium supplemented with epidermal growth factor (EGF) (20 ng ml ~ ) and fibroblast 
growth factor 2 (FGF2) (20 ng ml ~ ") (R&D Systems) and B-27 supplement (1:100, 
Invitrogen) and sorted for EGF-receptor (EGFR) expression. Nestin’ /TUJ1~* 
neurons were obtained by eliminating EGER* NSCs and CD44* non-neural cells 
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f, GFP expression, in neurons from one C* iPSC line, in UNC93B1 oa neurons, 
without lentiviral infection, or after infection with a lentivirus containing human 
wild-type UNC93B1 (UNC93B1 '~ + UNC93B1) or a mock construct 
(UNC93B1 ‘~ + mock). g, GFP expression in oligodendrocytes differentiated 
from UNC93B1~'~ iPSCs, from a C* hESC line and a C* iPSC line. Error 
bars, s.e.m., calculated from two to three experiments, from the C* hESC lines, 
the C* iPSC lines and the UNC93B1~‘~ lines, each studied in triplicate (b- 

d, e and g); or from one experiment carried out in triplicate, representative of two 
independent experiments (f). ANOVA was performed for the HSV-1-GFP levels 
shown in b to g. When significant, Dunnett’s t-tests were performed for two-by- 
two comparisons. *P < 0.05, **P < 0.01, ***P < 0.001 (comparisons between 
each control line and each patient line). 


from NPCs. For the generation of astrocytes, NPCs were allowed to proliferate in 
the presence of EGF and FGF2 for 40 to 60 days and were then cultured in medium 
containing 5% FBS for an additional 15 to 20 days. Immature oligodendrocytes 
were obtained from NPCs following treatment with SonicC25II (125 ng ml), 
FGF8 (100 ng ml |; R&D Systems), brain-derived neurotrophic factor (BDNF) 
(20 ng ml!) and ascorbic acid (0.2 mM) for 50 to 70 days and sorted for O4. 

TLR3 agonists and viral stimulation. A synthetic analogue of dsRNA, (polyino- 
sinic:polycytidylic acid (poly(I:C)); Amersham), a TLR3 agonist, was used at a con- 
centration of 25 ug ml 1 for 2, 4 or 6h of stimulation. For HSV-1 infection, we used 
the KOS-1 strain of HSV-1 anda GFP-expressing HSV-1 (HSV-1-GFP”*), at various 
multiplicities of infection (MOIs), to infect CNS cells. The induction of IFN-B, IFN-A 
and IFN-responsive genes was assessed in these cells, by quantifying mRNA by 
quantitative polymerase chain reaction with reverse transcription (RT-qPCR). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture. Fibroblasts obtained from an UNC-93B-deficient patient, a TLR3- 
deficient patient and a healthy control were maintained in DMEM medium sup- 
plemented with 10% fetal calf serum (FCS), 2mM .-glutamine, penicillin 
(50U ml!) and streptomycin (50 Lg ml '). The patients resided in France, where 
they were followed up and where informed consent was obtained, in accordance with 
local regulations, with institutional review board (IRB) approval. The experiments 
described here were conducted in the United States, in accordance with local 
regulations and with the approval of the IRB of the Rockefeller University, the 
Harvard Medical School and the Sloan-Kettering Institute for Cancer Research. 
Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs; 
line H9 (WA-09, XX, P40-55) were maintained on CF1-irradiated MEFs (iMEFs, 
Globalstem) in hESC medium consisting of DMEM with Ham’s F12 (Invitrogen) 
supplemented with 20% Knockout Serum (KOSR, Invitrogen), 10 ng ml ! basic 
fibroblast growth factor (bFGF, Gemini Bio-Products), 1mM L-glutamine 
(Invitrogen), 100 1M non-essential amino acids, 100 14M 2-f-mercaptoethanol 
(Sigma-Aldrich), penicillin (50 Uml ') and streptomycin (50 ug ml’). iPSCs 
were stimulated to differentiate into embryoid bodies by culture in bFGF-free 
hESC medium, without co-culture with feeder cells, on non-adherent Petri dishes, 
as described previously*'. 

Neural induction and neural subtype specification. MS5 stromal cells were 
grown in MEM medium supplemented with 10% FBS and 2 mM 1-glutamine”. 
The neural differentiation of hESCs and iPSCs was induced as described prev- 
iously~’, but in the presence of Noggin (R&D Systems) and $B431542 (Stemgent) 
from days3 to 10 of differentiation (dual-SMAD inhibition’’), to increase the 
efficiency of rosette formation. Rosettes were collected mechanically, starting at 
day 8 of differentiation, and were re-plated under feeder-free conditions on dishes 
coated with 10 pg ml | polyornithine (Sigma), 2 1g ml ? laminin (Cultrex) and 
1pugml' fibronectin (Fisher), in N2 medium supplemented with Sonic C25 II 
(20 mg ml |; R&D Systems), ascorbic acid (0.2 mM; Sigma) and BDNF (20 ng 
ml '; R&D Systems). Rosettes were allowed to proliferate for a further 5 days and 
were then re-plated, dissociated in Ca?- and Mg?-free Hank’s balanced salt solu- 
tion (HBSS) and re-plated again. Emerging clusters of NPCs were collected for 
further proliferation or neural subtype specification. For the generation of neural 
stem cells and neurons, NPCs were maintained in N2 medium supplemented with 
EGF (20ng ml!) and FGE2 (20 ng ml !; R&D Systems) and B-27 supplement 
without vitamin A (1:100; Invitrogen). For the generation of astrocytes, NPCs were 
allowed to proliferate and were passaged in the presence of EGF and FGF? for 40 to 
60 days and then exposed to N2 medium containing 5% FBS for an additional 15 to 
20 days. For the generation of oligodendrocytes, emerging clusters of NPCs were 
cultured in N2 medium supplemented with Sonic C25II (125ngml~'), FGF8 
(100 ng ml '; R&D Systems), BDNF (20 ng ml ') and ascorbic acid (0.2 mM) 
for 50 to 70 days. 

Plasmids and vectors. The polycistronic lentiviral pHAGE-STEMCCA-LoxP 
vector, carrying the OCT4, SOX2, KLF4 and C-MYC reprogramming factor genes, 
has been described elsewhere**. Human UNC93B1 complementary DNA was 
amplified from existing cDNA sequences with the following primers: forward 
5'-ATAATATGGCCACACATATGGAGGCGGAGCCG-3’ and reverse 5’- 
GTTGATTAGGATCTATCGTCACTGCTCCTCCGG-3’. The amplified prod- 
uct was inserted downstream from the internal ribosome entry site (IRES) in a 
pHAGE2-EFla-DsRedExpress-IRES-W lentiviral vector (available from the 
Mostoslavsky laboratory), with the In-Fusion Advantage PCR cloning Kit 
(Clontech). 

Lentivirus production. Lentiviruses containing STEMCCA or pHAGE2-EFla- 
DsRedExpress-IRES-UNC93B1 were produced with a five-plasmid transfection 
system, in 293T packaging cells, by a slightly modified version of a method 
described previously”. In brief, 293T cells were transfected with STEMCCA or 
pHAGE2-EFla-DsRedExpress-IRES-UNC93B1 and four plasmids encoding the 
packaging proteins Gag-Pol, Rev, Tat and the G protein of the vesicular stomatitis 
virus (VSVG), in the presence of the Trans-IT 293 transfection reagent (Mirus). 
Viral supernatants were collected every 12h, on 2 consecutive days, starting 48h 
after transfection, and viral particles were concentrated by ultracentrifugation at 
49,000g for 1.5h at 4°C. 

Lentiviral infection and human iPSC generation. We infected 10° fibroblasts 
derived from patients or controls with the concentrated polycistronic STEMCCA 
lentiviral vector and then cultured them at 37 °C, under an atmosphere containing 
5% CO,, in 2ml of hFib medium supplemented with 5 1gml | protamine sul- 
phate, for 24h. One day after infection, the viral supernatant was removed and the 
cells were cultured for 72h in hFib medium. They were then transferred onto 
iMEFs and the medium was replaced with hESC medium. iPSC colonies with an 
ESC-like morphology were mechanically isolated 4 to 5 weeks after infection. 
Immunostaining. Cells were fixed by incubation in 4% paraformaldehyde for 
30 min and permeabilized by incubation with 0.2% Triton X-100 for 30 min. Cells 


were stained in blocking buffer (3% BSA; 5% goat serum) with primary (or con- 
jugated) antibodies at 4°C overnight, washed and stained with secondary 
antibodies and 1pgml~' Hoechst 33342 in blocking buffer for 3h at 4°C, in 
the dark. Primary OCT4 and NANOG antibodies (Abcam) were used at a con- 
centration of 0.5 jig ml‘, and an Alexa Fluor 555-conjugated anti-rabbit IgG 555 
(Invitrogen) was used as the secondary antibody (1:2000). The following conju- 
gated antibodies—TRA-1-60-Alexa Fluor 647, TRA-1-81-Alexa Fluor 488, SSEA- 
4-Alexa Fluor 647, and SSEA-3-Alexa 488 (Millipore)—were used at a dilution of 
1:100. FoxG1 antibody was a gift from S. A. Anderson. Nestin antibody was 
obtained from Neuromics, TUJlantibody from Covance, 04 from Millipore, 
GFAP from Dako, PLZF from Calbiochem, and ZO1 from Zymed. Images were 
acquired with a Pathway 435 bioimager equipped with a X10 objective (BD 
Biosciences). 

Whole-exome sequencing and analysis. DNA (3 ,1g) was extracted from cells and 
sheared with a Covaris $2 Ultrasonicator (Covaris). An adaptor-ligated library was 
prepared with the TruSeq DNA Sample Prep Kit (Illumina). Exome capture was 
carried out with the SureSelect Human All Exon 50 Mb kit (Agilent Technologies). 
Paired-end sequencing was performed on an Illumina HiSeq 2000 (Illumina), 
generating 100-base reads. The sequences were aligned with the human genome 
reference sequence (hg19 build) using the BWA aligner*’. Downstream processing 
was carried out with the Genome Analysis Toolkit (GATK)*”, SAM tools** and 
Picard Tools (http://picard.sourceforge.net). Variant calls were made with GATK 
UnifiedGenotyper. All calls with a Phred-scaled SNP quality of =20 were filtered 
out. GATK VariantEval was used to compare the call sets for fibroblasts and iPSCs. 
FACS-mediated cell purification. Cells were dissociated with Accutase (Inno- 
vative Cell Technologies) and subjected to FACS with 04 (1:300; Millipore), CD44 
FITC (2 ul per 10° cells; Abcam), and EGFR PE (10 pl per 10° cells; Abcam) 
antibodies, on a FACS Aria II machine (BD). 

Karyotype analysis. Karyotyping and G-banding were carried out blind, by Cell 
Line Genetics. 

Mutation analysis. Whole-genome DNA was isolated from fibroblasts and iPSCs 
with the QiAMP DNA Kit (Qiagen). Exon 8 of the UNC93B1 gene was amplified 
with the forward primer GCGTGGCTTTGTGCTGAGAG and the reverse primer 
CAGGAGGGGGATATTTGGGA. Reaction products were purified with the 
QlAquick PCR purification kit (Qiagen) and sequenced by the DF/HCC DNA 
Sequencing Facility. The results were analysed with Sequencher 4.8 software (Gene 
Codes Corporation). 

Microarray analysis. Total RNA was extracted with Trizol reagent (Invitrogen). 
RNA was collected from astrocytes and from CD44 /EGFR neurons differen- 
tiated from control hESCs or UNC-93B-deficient iPSCs. The RNA was then 
processed by the MSKCC Genomic core facility and hybridized with Illumina 
human HT-12 oligonucleotide arrays. Gene-expression analysis was carried out 
with the Partek Genomics Suite: following quantile normalization, all the genes 
displaying differential expression (FDR of 0.05, fold change of at least +2) with 
respect to hESC (total of 7,210 genes) in each population were visualized by 
clustering. Raw data for the microarray analyses performed in this study are 
available from the public repository of GEO Data Sets (accession number 
GSE40593). 

Electrophysiology. Whole-cell current clamp recordings were performed at room 
temperature (23-24 °C) in a Multiclamp 700B amplifier (Molecular Devices), as 
described previously”’*°. Neurons were identified under a Nikon microscope 
equipped with a <4 objective and a X40 water-immersion objective. Cells were 
continuously perfused with freshly prepared extracellular solution containing 
126mM NaCl, 26mM NaHCO;, 3.6mM KCl, 1.2mM NaH,PO,, 1.2mM 
MgCl, 2mM CaCl, and 17mM glucose, and the solution was saturated with 
95% Oy and 5% CO). The intracellular solution contained 135mM potassium 
gluconate, 5mM NaCl, 10mM HEPES, 0.5mM EGTA, 3mM potassium ATP, 
0.2 mM sodium guanosine triphosphate (GTP) and 10 mM sodium phosphocrea- 
tine. The pH was adjusted to 7.3 with KOH, and the osmolarity of the solution was 
approximately 290 milliosmoles (mOsm). Input resistance was calculated from the 
voltage response elicited by the intracellular injection of a depolarizing (+10- or 
+20-pA) current pulse. Current steps were applied for 1s to evoke action poten- 
tials. Liquid junction potentials were calculated and corrected off-line. Data were 
analysed with Clampfit (Molecular Devices) and SigmaPlot 11 (Systat Software) 
and are presented as means + s.e.m. 

Quantitative PCR with reverse transcription. For analysis of the expression 
profiles of key genes involved in stem cell properties and pluripotency, total 
RNA was extracted with the mirVana RNA isolation kit (Ambion). We reverse- 
transcribed 100 ng of total RNA to generate cDNA, in qScript cDNA Supermix 
(Quanta). RT-qPCR was then performed in an AB 7500 Real-Time PCR system 
(Applied Biosystems), with the PowerSYBR Green PCR Master Mix (Applied 
Biosystems). The results were analysed with SDSv1 Software and normalized with 
respect to B-actin gene expression. Expression levels were determined by relative 
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quantification using the comparative Ct (ddC;) method and are expressed relative 
to those in the individual parental cell lines. The primer sequences used have been 
described elsewhere’. We assessed the expression levels of genes of the TLR3 and 
IFN pathways and of the genes for IFN-B, IFN-A, NF-«B and MX1, by extracting 
total RNA from NSCs, neurons, oligodendrocytes and astrocytes. RNA was 
reverse-transcribed directly, with oligo(dT), to determine mRNA levels for 
TLR3- and IFN-pathway genes and for IFN-B, IFN-A, NF-KB and MX1. RT- 
qPCR was performed with Applied Biosystems Assays-on-Demand probe and 
primer combinations and universal reaction mixture, in an ABI 7500 Fast Real- 
Time PCR System (Applied Biosystems). The B-glucuronidase complex (GUS) 
gene was used for normalization. Results are expressed according to the AACt 
method, as described by the manufacturer. 

TLR3 agonists and viral stimulation. We used a synthetic analogue of dsRNA, 
(polyinosinic:polycytidylic acid (poly(I:C)); Amersham), a TLR3 agonist, at a 
concentration of 25 pg ml *. After incubation with or without poly(I:C) for 2, 4 
or 6h, cells were collected in Trizol for RNA extraction. For HSV-1 infection, we 
used the KOS-1 strain, at a multiplicity of infection (MOI) of 1. A GFP-expressing 
HSV-1 (HSV-1-—GFP?*) was used at various MOIs to infect NSCs, neurons, oli- 
godendrocytes and astrocytes. 

Cytokine determinations. The production of IFN-«, IFN-B, IFN-A and IL-6 was 
assessed by enzyme-linked immunosorbent assay (ELISA). Separate ELISAs were 
carried out for each of IFN-« (AbCys SA), IFN-B (TFB, Fujirebio) and IL-6 
(Sanquin), according to the kit manufacturers’ instructions. The IFN-A ELISA 
was developed in the laboratory, as described previously”. 

HSV-1-GFP infection and quantification. For HSV-1-GFP infection, 10* 
NSCs, neurons, oligodendrocytes or astrocytes were plated in individual wells of 
96-well plates and infected with HSV-1-GFP, at various MOI, in a medium 
appropriate for the cell type concerned. Cells were incubated for 2 h, then washed 
and incubated in 100 pil of culture medium. HSV-1-GFP titres were determined by 
measuring the GFP fluorescence density. The GFP fluorescence of the samples was 
quantified at the 2, 8, 18 and 24h after the start of experimentation. For assays of 
cell protection after viral stimulation, cells were treated with IFN-«2B (10* inter- 
national units (IU) ml’; Schering-Plough), IFN-B (10*IUml '; PBI 
Interferonsource), IFN-A1 (2.5 pg ml 7; R&D Systems), poly(I:C) (25 ug ml '), 
CpG-A (Spgml~’; Dynavax Technologies) or CpG-C (Spgml ', Dynavax 
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Technologies) for 36h before infection for neurons, oligodendrocytes, NSCs 
and astrocytes, as appropriate. 

Statistical tests. Mean values of IFN induction levels or HSV-1-GFP levels, from 
control cells and patients’ cells, were compared using ANOVA and/or Student’s 
t-tests, as implemented in the procedures PROC TTEST and PROC ANOVA of 
SAS version 9.1 (SAS institute). ANOVA was carried out to compare the means of 
more than two groups. When significant (P< 0.05), ANOVA was followed by 
t tests for pairwise comparisons (in particular, Dunnett’s t tests comparing the 
patient with each of the controls). 
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The mammalian host has developed a long-standing symbiotic re- 
lationship with a considerable number of microbial species. These 
include the microbiota on environmental surfaces, such as the 
respiratory and gastrointestinal tracts’, and also endogenous re- 
troviruses (ERVs), comprising a substantial fraction of the mam- 
malian genome’. The long-term consequences for the host of 
interactions with these microbial species can range from mutu- 
alism to parasitism and are not always completely understood. 
The potential effect of one microbial symbiont on another is even 
less clear. Here we study the control of ERVs in the commonly used 
C57BL/6 (B6) mouse strain, which lacks endogenous murine leuk- 
aemia viruses (MLVs) able to replicate in murine cells. We demon- 
strate the spontaneous emergence of fully infectious ecotropic* 
MLV in B6 mice with a range of distinct immune deficiencies 
affecting antibody production. These recombinant retroviruses 
establish infection of immunodeficient mouse colonies, and ulti- 
mately result in retrovirus-induced lymphomas. Notably, ERV 
activation in immunodeficient mice is prevented in husbandry 
conditions associated with reduced or absent intestinal micro- 
biota. Our results shed light onto a previously unappreciated role 
for immunity in the control of ERVs and provide a potential mech- 
anistic link between immune activation by microbial triggers and a 
range of pathologies associated with ERVs, including cancer. 

Retroviruses can establish germline infection and become part of 
the host genome”’. Most, if not all, ERVs have become inactive owing 
to mutations, or transcriptionally silenced through the action of diverse 
mechanisms~’. However, RNA and protein expression of replication- 
defective ERVs are frequently increased in infection, autoimmunity and 
cancer”. Whether or not the immune system defends against potential 
threats posed by ERVs is unclear. To address the role of adaptive im- 
munity in this process, we assessed ERV expression in B6 mice. We 
initially compared the transcriptional profiles of purified macrophages 
from B6 wild-type and T- and B-cell-deficient Rag!~‘~ mice. The two 
transcripts with the highest increase in expression levels in macro- 
phages from Rag] ‘~ mice (Fig. 1a) correspond to the env and gag 
genes, respectively (Supplementary Table 1), of an endogenous ecotro- 
pic MLV (eMLV) locus, Emv2, a replication-defective single-copy ERV 
present in B6 mice’. Differential expression of eMLV was confirmed by 
quantitative reverse transcriptase PCR (qRT-PCR) for spliced env mes- 
senger RNA in macrophages (Fig. 1b) and in several tissues (Fig. 1c). 

Other ERV families were also differentially expressed in macro- 
phages, albeit less strongly (1.7-2.1-fold; Supplementary Table 1). 
Not distinguishing between members of multicopy families, expres- 
sion of polytropic MLVs (pMLVs), xenotropic MLVs (xMLVs) and 
the Mus musculus type D (MusD) retrovirus family of retrotranspo- 
sons was also increased in the lungs of Rag] ‘~ mice (Supplementary 
Fig. 1). In line with increased MLV mRNA expression, 60-80% of total 
splenocytes and all haematopoietic lineages analysed from Rag] '~ 
but not wild-type mice, expressed MLV surface glycoprotein (SU) 
(Fig. 1d, e). 


Tcra'~ or Tcrd~'~ mice, lacking T-cell receptor (TCR)aB and 
TCRy6 T cells, respectively, showed low eMLV expression (Fig. 1f). 
eMLYV expression was similarly low in H2-A,E '~ mice (MGI allele 
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Figure 1 | eMLV activation in antibody-deficient mice. a, Significantly 
upregulated (>4-fold) genes in CD11b* MHC-II"B220~ Grl~ macrophages 
from Rag!‘ mice compared with macrophages from wild-type (WT) mice. 
Triplicate microarrays from cells isolated from 40 mice are shown. b, eMLV 
spliced env mRNA expression in the same cells as in a. Each symbol represents 
macrophages from 20 mice (P = 0.024; paired Student’s t-test). c, eMLV spliced 
env mRNA expression in indicated organs from wild-type or Rag] ‘~ mice 
(spleen: P = 0.020; ileum: P = 0.032; colon: P = 0.004; lung: P = 0.001; muscle: 
P= 0.016; and kidney: P = 0.009; unpaired Student’s t-test). d, e, MLV SU 
expression (detected using the 83A25 monoclonal antibody; see Methods) in 
splenocytes (d) or indicated cell types (e) from wild-type or Rag] /~ mice. 

f, eMLV spliced env mRNA expression in the spleens of the indicated strains 
(P< 0.001 between wild-type and either Ighm ‘~ or Ighm '~ MD4 mice; one- 
way analysis of variance (ANOVA)). g, MLV SU expression in splenic 
lymphocytes from wild-type or Ighm ‘~ MD4 mice. h, eMLV spliced env 
mRNA expression in the spleens of the indicated strains (P < 0.001 between 
wild-type and either Myd88 ‘~ or Tlr7-‘~ mice; one-way ANOVA). Inc, fand 
h, each symbol is an individual mouse. In d, e and g, plots are representative of 
four mice per group. In f and h, values above 10° were considered high and are 
indicated by red-filled symbols. 
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H244el-Fa. Jacking the region between the H2-Ab1 and H2-Ea genes), 
deficient in major histocompatibility complex (MHC) class II, MHC 
II-restricted T cells and T-cell-dependent antibodies (Fig. 1f). By con- 
trast, mice lacking B cells (Ighm™'~ mice) or mice unable to produce 
polyclonal antibodies (Ighm '~ MD4 mice) expressed substantially 
higher eMLV levels than did wild-type mice (Fig. 1f), demonstrating 
that high eMLV expression characterized mice lacking Bingen Spee 
cific antibodies. In addition to splenocytes from Ragl " mice 
(Fig. le), T and B cells from Ighm " MD4 mice, but not from wild- 
type control mice, expressed MLV SU (Fig. 1g), indicating that eMLV 
can also be highly expressed in lymphocytes. 

Toll-like receptors (TLRs) have been implicated in the control of 
B-cell responses, and Myd88~'~ and Tir7~'~ mice have significantly 
reduced serum levels of natural antibodies and a defective antibody 
response to immunization or infection, including with retroviruses®°. 
Notably, the expression of eMLV was markedly increased in 
Myd88-'~ and Tir7~‘~ mice in comparison with wild-type mice 
(Fig. 1h). Similarly increased eMLV expression was observed in 
Tlr7’~ mice, but not in Tir9~/~ mice housed in a different facility 
(Supplementary Fig. 2). 

To investigate the mechanistic link between antibody deficiencies 
and increased MLV expression, we examined the origin of eMLV 
transcription. The B6 genome does not contain replication-competent 
eMLV proviruses, and, although the Emv2 locus can produce mRNA, 
it is unable to produce infectious virus owing to an inactivating G-to-C 
mutation at position 3576 of the pol region®"®. In addition, Emv2 gag 
encodes an N-tropic capsid, which would be restricted by the Fv1? 
restriction factor in B6 mice’’. However, it was theoretically possible 
that recombination between replication-defective Emv2 and non- 
ecotropic MLVs resulted in an MLV with full infectivity'' that could 
spread in Rag! ‘~ mice. Remarkably, the plasmas of young and old 
Rag ' ~ mice, but not of wild-type control mice, contained retrovi- 
ruses that were capable of replicating in mouse cells in vitro (Fig. 2a), 
which we refer to as Rag1 ~'~ mouse-associated retroviruses (RARVS). 
Sequencing of the pol region demonstrated repair of the Emv2- 
inactivating mutation in all RARV isolates (Supplementary Fig. 3). 
Functional in vitro assays (Fig. 2b) and sequencing of the gag region 
(Fig. 2c) showed that RARVs also exhibited B-tropism. Genome 
sequence comparisons between RARVs showed that young Rag] /~ 
mice contained highly similar viruses, which diverged substantially in 
old Ragl ' ~ mice (Fig. 2d). All RARVs were recombinants between 
Emv2 and endogenous non-ecotropic MLVs (Supplementary Fig. 4). 
The pol defect of Emv2 was probably restored in RARVs by recom- 
bination with Xmv43 (also known as Bxv1; Supplementary Fig. 4), an 
ERV that contains a functional pol region but is unable to infect mouse 
cells owing to polymorphisms in the mouse cellular receptor’. 
Recombination events involving Xmv43 have also been found to be 
responsible for the emergence of leukaemogenic MLVs in AKR mice”. 
However, the switch in capsid tropism resulted from recombination 
with other endogenous xMLVs (Supplementary Fig. 4). Notably, the 
divergence of RARVs isolated from old Ragl ‘~ mice was due to 
further recombination replacing the ecotropic env with polytropic 
env from Pmv1, Pmv5 or Pmv16 (Supplementary Fig. 4). Together, 
these findings indicate the emergence of infectious eMLVs that could 
have infected Rag]’~ mice and given further rise to infectious 
pMLVs. Supporting this notion, most gag/pol eMLV mRNA detected 
in Ragl~'~ mice seemed to be transcribed from integrated RARVs, 
rather than the germline copy of Emv2 (Supplementary Fig. 5). 

Emv2 and non-ecotropic MLV recombination events resulting in 
infectious eMLV generation might occur de novo in individual 
Ragl '~ mice. Alternatively, these RARVs might have been vertically 
transmitted through successive generations. To test the latter possibi- 
lity directly and assess the capacity of RARVs to spread in immuno- 
deficient mice, we established a colony of Ragl~'~ Emv2~'~ mice by 
first crossing an Emv2 ‘~ male mouse" witha Ragl‘~ female mouse, 
and then intercrossing selected progeny to homozygosity. These mice 
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lack the germline copy of Emv2, meaning any infectious eMLV present 
would have been vertically transmitted. Both eMLV spliced env mRNA 
(Fig. 2e) and MLV SU expression (Fig. 2f) were readily detected in the 
spleens of Ragl~'~ Emv2~‘~ mice in this colony. Furthermore, ana- 
lysis of eEMLV env and pol DNA copies indicated extensive replication 
of vertically transmitted RARVs in Rag] ‘~ Emv2~'~ mice (Fig. 2g). 
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Figure 2 | Retroviraemia and leukaemias/lymphomas in antibody-deficient 
mice. a, Detection of infectious MLV (RARV-5/XG7) from the plasma of a 
representative Rag] ‘~ mouse by restoring infectivity of the green fluorescent 
protein (GFP)-expressing XG7 retroviral vector in the indicated cell type. 
Numbers within the plots denote the percentage of retrovirally transduced 
(GEP*) cells. b, Fvl tropism of RARVs isolated from 6 (R2-R4)- or 25 
(R5-R8)-week-old healthy Rag] ‘~ mice, shown as the ratio of infectivity in 
B-3T3 to N-3T3 cells (B:N ratio). B- and N-tropic strains of Friend MLV 
(F-MLV) are shown for comparison. c, Amino acid residues of capsid positions 
105-113 (CA105-113) deduced from the nucleotide sequence of Emv2 and the 
same RARVs as in b. Dots indicate identities. d, Phylogenetic tree of the same 
RARVsas in b. The scale indicates the probability of base substitution per site. 
e, eMLV spliced env mRNA expression in the spleens of Rag] /~ mice or 
vertically infected Rag] '~ Emv2~'~ mice. Each dot is an individual mouse 
(P< 0.001; unpaired Student’s t-test). Values above 10° were considered high 
and are indicated by red-filled symbols. f, MLV SU expression in splenocytes 
from Rag] ‘~ or vertically infected Rag] ‘~ Emv2~'~ mice (representative of 
nine mice per group). g, eMLV DNA copy numbers per haploid genome, 
determined by qPCR for the pol or ecotropic env gene, in DNA from the spleens 
of healthy Rag] ‘~ mice (right) or vertically infected Ragl”'~ Emv2~'~ mice 
(left). Symbols represent individual mice, grouped according to their age. The 
sensitivity limit of this PCR method was determined as a median of 0.0003 
copies per haploid genome, using Emv2 '~ mice. eMLV DNA copy numbers 
for Ragl ~~ mice include Emv2 (1/N). h, Tumour (leukaemias/lymphomas) 
incidence in cohorts of wild-type (n = 37), Ragl ~~ (n= 38) or vertically 
infected Rag!” '~ Emv2~'~ mice (n = 23) at the NIMR SPF facility 
(P<0.000001 between wild-type and Rag! '~ mice; P = 0.00025 between 
wild-type and Rag] '~ Emv2 '~ mice; log-rank survival analysis). 
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By contrast, sexual or in utero infection was not observed in separate 
crosses of either male or female virus-positive Rag” '~ Emv2~'~ mice 
with virus-free Emv2 '~ mice (Supplementary Fig. 6). 

To examine the potential of RARVs to replicate in Rag]~'~ mice 
further, we assessed the frequency of tumours characteristic of retro- 
viral infection’® in cohorts of Ragl~'~ mice. Notably, starting from 
180 days and affecting 67% of the animals by 380 days, Rag1~'~ mice, 
but not wild-type control mice, showed signs of morbidity (Fig. 2h). 
On examination, large tumours, often associated with anaemia, were 
observed in all morbid Ragi~‘~ mice (Fig. 2h and Supplementary 
Fig. 7). The pathogenic potential of infection with RARVs was estab- 
lished in aged cohorts of vertically infected Rag] ‘~ Emv2~'~ mice, 
which developed tumours at a comparable incidence rate (Fig. 2h). 

Thymic or splenic tumours in Rag] ‘~ mice consisted mainly of a 
single MLV SU-expressing cell type, which differed between animals, 
and had the histological appearance of lymphoblastic lymphosarco- 
mas (Supplementary Fig. 8). Discrete chromosomal aberrations were 
found in most tumours analysed (Supplementary Fig. 8), suggestive of 
clonal origin. Consistent with MLV production by tumour cells, we 
observed an abundance of MLV-type particles in the extracellular 
space of tumour samples, but not in a spleen sample from a healthy 
Ragl ‘~ mouse (Supplementary Fig. 9). Furthermore, a substantial 
increase in both eMLV env and pol DNA copy numbers was detected 
in all tumour samples, with one exception in which only pol DNA 
copies were increased (Supplementary Fig. 9), indicating that RARVs 
had extensively infected the cells that gave rise to lymphomas. 
Together, these results support a model in which several recombina- 
tion events restore Emv2 infectivity, leading to spontaneous retrovir- 
aemia and vertical transmission to progeny, and eventually drive an 
oncogenic process similar to that extensively described in mouse 
strains carrying fully infectious ERVs””. 

Our results associated a lack of antibodies with establishment of infec- 
tious eMLVs in mouse colonies. Next, we investigated the potential mode 
of antibody action. Antiretroviral antibodies have a long-established role 
in limiting the spread of infectious endogenous retroviruses, both 
within and between animals. However, it was also possible that antibodies 
were preventing a step before the emergence of infectious eMLV re- 
combinants. Rescue of Emv2 infectivity by recombination with a non- 
ecotropic MLV necessitates co-expression of both proviruses in the same 
cell at sufficient levels for co-packaging into the same virion. Low expres- 
sion of these proviruses in wild-type mice could be a rate-limiting factor 
in the emergence of infectious eMLVs. However, expression of certain 
endogenous MLVs in mouse cells is known to be inducible, notably, by 
microbial products**"*. For example, bacterial lipopolysaccharide (LPS) 
stimulation activates non-ecotropic MLVs, and Xmv43 in particular’>”. 
To examine the responsiveness of ERVs and other retroelements to mi- 
crobial stimulation, we took advantage of probes or probe sets in standard 
microarray platforms that report ERV or retroelement expression. 
Analysis of a publicly available data set”? uncovered specific induction 
of non-ecotropic MLV transcripts by LPS and polyinosinic:polycytidylic 
acid (poly(I:C)), and suppression by Pam3CSK, (Fig. 3a, b). Early trans- 
poson transcripts were also induced by poly(I:C) (Fig. 3a). These findings 
were confirmed by LPS stimulation, which induced high MLV SU ex- 
pression in wild-type and Emv2-‘~ mouse splenocytes, also in a MyD88- 
independent manner (Fig. 3c and Supplementary Fig. 10). These in vitro 
conditions did not significantly induce eMLV expression (Fig. 3a). How- 
ever, ex vivo analysis showed higher expression of eMLV (Fig. 1c), as well 
as of other ERVs/retroelements (Supplementary Fig. 1), specifically in the 
colon, suggestive of microbial involvement. 

Antibodies have established roles in controlling intestinal bacteria 
and neutralizing their products, such as LPS, in the gut lumen or the 
systemic circulation, and antibody-deficient mice are known to display 
increased microbial trasnslocation'’”*'~’. It was, therefore, possible 
that antibody deficiency allowed microbial products to induce expres- 
sion of MLV proviruses in Ragl~'~ mice, including the parents of 
recombinant RARVs (Supplementary Fig. 1). In support of this notion, 
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Figure 3 | Mouse ERV activation by microbial products. a, ERV/ 
retroelement-reporting probe set (Supplementary Table 3) signals in a publicly 
available Affymetrix HT mouse genome 430A microarray data set”? 
(ArrayExpress accession E-GEOD-17721) of wild-type B6 bone marrow- 
derived dendritic cells after stimulation with microbial products. Black arrows 
indicate the probe sets that are significantly regulated (P < 0.05) more than 
twofold by at least one stimulus. Emv2-specific probe sets (annotated as Mela) 
are also indicated by grey arrows for comparison. ETn, early transposon. 

b, Mean log, fold change in the MLV-reporting probe set in the same data set. 
c, MLV SU expression in wild-type or Emv2 ' splenocytes before (open 
histograms) or after (filled histograms) stimulation with 10 1g ml~* LPS for 
48 h, according to forward scatter (Fsc) and CD19 expression. Numbers in the 
plots denote the percentage of cells within each gate and represent two donors 
each analysed in duplicate. 


and in agreement with the established role of natural IgM in systemic 
clearance of bacterial LPS and protection from endotoxaemia’*’”, 
production of non-hypermutated IgM alone was sufficient for 
eMLV control (Supplementary Fig. 11). If antibodies required for 
preventing MLV expression were, indeed, against several microbial 
products, then antibody deficiency should not result in increased 
MLV expression in the absence of microbial triggers (Supplemen- 
tary Fig. 12). 

To begin to examine the contribution of microbial triggers, we 
measured eMLV expression in specific pathogen-free (SPF) Rag] /~ 
mice from colonies that differed in intestinal microbiota. Importantly, 
the use of embryo transfer for the rederivation of these independent 
colonies removes adventitious organisms, including vertically trans- 
mitted eMLVs. Therefore, any RARVs found in these rederived 
Ragl ‘~ mouse colonies must be generated de novo in the life-history 
of each colony. In stark contrast to Rag! '~ mice that were maintained 
on neutral pH water at the National Institute for Medical Research 
(NIMR), colonies of Ragl '~ mice that were maintained on acidified 
water expressed minimal eMLV levels (Fig. 4a). Water acidification 
reduced overall bacterial diversity in the colons of Ragl~‘~ mice 
(Supplementary Fig. 13 and Supplementary Table 2) and is a common 
precautionary measure used in many animal facilities that reduces 
bacterial colonization within the intestinal tract and translocation into 
the circulation’*. Lack of e€MLV expression was noted in Rag1~/~ mice 
obtained from the Jackson Laboratory (JAX), also maintained on 
acidified water (Fig. 4a). Furthermore, minimal levels of eEMLV were 
detected in Rag] mice on neutral pH water at the Rodent Center 
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Figure 4 | eMLV activation in antibody-deficiency depends on husbandry 
conditions. a, eMLV spliced env mRNA expression in the spleens of Rag] '~ 
mice on neutral pH (SPF) or acidified water (pH 2.5) at the NIMR, on acidified 
water (pH 2.8) at JAX, on neutral pH at RCHCLI, or in germ-free (GF) facilities 
at UMICH (P< 0.016 between Ragl ‘~ mice at SPF NIMR and all other 
groups; one-way ANOVA). b, eMLV spliced env mRNA expression in the 
spleens of Ighm '~, Myd88-'~ or TIr7 ‘~ mice on neutral pH water (SPF) at 
the NIMR or on acidified water (pH 2.8) at JAX (P = 0.005 and P = 0.029 for 
Ighm! ~ and TIr7 /~ mice, respectively; unpaired Student’s t-test). Each dot is 
an individual mouse and values above 10° were considered high and are 
indicated by red-filled symbols. 


HCI (RCHCI; Fig. 4a), which contained distinct bacterial genera in 
comparison with Rag] /~ mice at NIMR (Supplementary Fig. 13 and 
Supplementary Table 2). Lastly, negligible levels of eMLV were 
expressed in Ragl'~ mice in germ-free facilities at the University 
of Michigan (UMICH) and offered neutral pH water (Fig. 4a). The 
latter two conditions also distinguished between effects of acidified 
water on intestinal flora and effects on other physiological processes. 
Thus, high eMLV expression in independently rederived colonies 
of Rag! '~ mice correlated with the presence of the normal SPF 
microbiota. 

Husbandry conditions contributed to high eMLV expression also in 
independently rederived strains with distinct immunodeficiencies. 
Colonies of Ighm~/~ and Tir7-‘~ mice maintained on acidified water 
at JAX expressed minimal levels of eMLV (Fig. 4b). Variable eMLV 
expression levels were detected in Myds8' ~ mice at JAX (Fig. 4b), 
probably because, once generated, vertical transmission of infectious 
eMLVs was unaffected by water acidification. Although defining the 
precise role of the microbiota on eMLV induction will require further 
investigation, collectively our results demonstrate that the high eMLV 
expression phenotype, in genotypes causing immunodeficiency, re- 
quires environmental interaction. 

High eMLV expression in independently rederived immunodefi- 
cient colonies reveals de novo eMLV induction in each colony. It does 
not, however, indicate when or how often in the life-history ofa colony 
infectious eMLVs may emerge. Adult RARV-free Rag! ‘~ mice, pre- 
viously on acidified water, maintained low eMLV expression after a 
switch to neutral pH water, indicating a low probability of RARV 
emergence and spread in an individual mouse. To examine whether 
this probability is low in general or could be higher during early mouse 
development, we monitored successive generations of Igim™'~ mouse 
colonies from two independent rederivations, one of which was recent, 
into the NIMR SPF facility. This analysis suggested that eMLV induc- 
tion occurred during the first few filial generations (F; Supplementary 
Fig. 14). Moreover, eMLV-expressing Myd88‘~ mice at JAX (Fig. 4b), 
were at F; of homozygous breeding. Thus, although low in individual 
mice, there is high cumulative probability of infectious eMLV emer- 
gence, involving a sequence of recombination events similar to those 
seen in Rag] ‘~ mice (Supplementary Fig. 12), and subsequent estab- 
lishment in an antibody-deficient colony over a few generations. 
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The B6 strain has historically dominated many research fields, 
partly owing to the resistance of this strain to retrovirally induced 
tumours. Our results demonstrate that this important attribute of B6 
mice is conditional on their immune competence, with significant 
implications both for the design and interpretation of mouse studies. 
Furthermore, our results show that ERV activation is determined by 
husbandry conditions, thus accentuating potential differences in ERV 
expression between animal facilities. 

Although well-established in the mouse”, the oncogenic potential 
of ERVs in humans has not been observed”*”*. However, non-long 
terminal repeat (LTR) retroelement families have been documented 
to have caused human cancers by insertional mutagenesis at the so- 
matic level’*”®. Interestingly, we found that TLR stimulation of human 
cells induced expression of distinct ERVs and retroelements, including 
the mammalian apparent LTR retrotransposon (MaLR) family 
(Supplementary Fig. 15), previously implicated in the pathogenesis 
of human lymphomas”. Transcription of human ERVs and retroele- 
ments can also be induced by physiological activation of both adaptive 
and innate immune cells”*. Moreover, increased risk of lymphomas 
in humans is linked to infection or inflammation” and also to anti- 
body deficiencies*’. Thus, interactions between microbial symbionts, 
leading to ERV/retroelement activation, may provide a mechanistic 
link between cancer and stimulation of the immune system by micro- 
biota or pathogenic infections. 


METHODS SUMMARY 

Mice. Inbred C57BL/6J (B6) or genetically modified B6-backcrossed mice were 
from the following facilities: NIMR (London, UK); JAX (Massachusetts, USA); 
UMICH (Michigan, USA); CIML (Marseille, France); London Research Institute, 
Cancer Research UK (London, UK); RCHCI (Ziirich, Switzerland); or Center for 
Cancer Research, National Cancer Institute (Maryland, USA). Mice were kept in 
individually ventilated cages in SPF facilities or in germ-free isolators. Mice were 
maintained on either neutral pH or acidified water. Where indicated, colonies were 
established by rederivation into each of these facilities. This process ensures the 
removal of adventitious agents and leads to colonization of the newly rederived mice 
with the specific microbiota of each facility (or the lack of colonization in the case of 
the germ-free facility). All animal experiments were approved by ethical committees 
of respective institutes, and conducted according to local guidelines and regulations. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mice. Inbred B6 and B6-backcrossed Ragl-deficient B6.129S7-Ragl'”"!"™°" 
(Ragl '~) mice, TCR «deficient B6.129P2-Tera”™™” (Tcra~'~) mice”, TCR 
5-deficient B6.129S7-Terd”™!°" (Tcrd~'~) mice**, MHC class II-deficient mice 
B6.12982-H2"4"1"4 (H12-A,E '~) mice*, B-cell-deficient, Ig heavy constant chain 
ji-targeted B6.129S2-Ighm'"'&" (Ighm‘~) mice*, hen-egg lysozyme (HEL)- 
specific B cell receptor-transgenic (Ighm~'~ MD4) mice**, MyD88-deficient 
B6.129P2-Mydss'"4k (Myds8 / ~) mice*’, toll-interleukin 1 receptor (TIR) 
domain-containing adaptor protein (TIRAP)-deficient B6.129P2-Tirap'”"“* 
(Tirap ') mice’, and toll-like receptor adaptor molecule 1 (TICAM-1)-deficient 
B6.129P2-Ticam1'""4*' (Ticam1~'~) mice*® have been described. B6-congenic 
mice lacking Emv2 (Emv2~‘~) were generated by 12 nuclear generations onto 
the B6 genetic background of the respective site on chromosome 8 from the A/J 
strain, which lacks the Emv2 integration, and have been previously described”. 
Emv2'~ mice were subsequently crossed with B6-backcrossed Ragl ~/~ mice to 
create Rag] /~ Emv2~'~ mice. These strains were maintained in individually 
ventilated cages (IVCs) in SPF facilities at the NIMR, and kept on ultraviolet- 
irradiated and filtered neutral pH water. In addition, separate colonies of Rag!‘ , 
Ighm, Myd88"'~, Tirap '~ and Tcam1 ‘~ mice were maintained in the same 
facilities and constantly kept on a diet of acidified water (pH 2.5). 

B6 and B6-backcrossed B6.129S7-Ragl paler yy (Ragl ~~) mice, B6.129S2- 
Ighmi!"/y (Ighm'~) mice, B6.12981-THr7"""/J_ (TIr7-'/~) mice", and 
B6.129P2-Mydsg'"t 1 IY (M. fyd83! ~) mice” were also maintained in SPF faci- 
lities at the Jackson Laboratory, and were kept on acidified water (pH 2.8-3.2). 

B6 and B6-backcrossed B6.129S7-Rag1”"!°" (Rag1'~) were also maintained 
in germ-free facilities at the UMICH, and kept on autoclaved distilled water. 

B6 and B6-backcrossed B6.129S1-Tr7"’""" (TIr7~/~) mice, and B6.129P2- 
Tr9"4" (Tir9'~) mice’? were also maintained in SPF facilities at the Centre 
d'Immunologie de Marseille-Luminy (CIML), and kept on autoclaved water. 

B6-backcrossed TLR7-deficient B6.129P2-TIr7”"!“* (Tir7~'~) mice" were also 
maintained in SPF facilities at the London Research Institute (LRI), Cancer 
Research UK, and kept on autoclaved water. These mice were received into the 
NIMR quarantine facility and bred before testing. 

B6 and B6-backcrossed B6.129S7-Ragl nnd Mom (Ragl ~!~) mice, B cell-deficient, 
Ig heavy chain joining region-targeted B6.129S7-Igh-J'"™"™ (Igh-J~/~) mice*’, 
IgA-deficient, Ig heavy constant « chain-targeted B6.129S7-Igha'™™™ 
(Igha'~) mice", and polymeric Ig receptor-deficient B6.129P2-Pigr'”"*" 
(Pigr ‘~) mice*’ were maintained in individually ventilated cages in SPF facilities 
at RCHCI and kept on autoclaved water. 

CS57BL/6NCri-backcrossed B6.129S2-Ighm"™'" (Ighm~'~) and activation- 
induced cytidine deaminase-deficient B6.CBA-Aicda’”™"#°" (Aicda’'~) mice**, 
were also maintained in SPF facilities at the Center for Cancer Research, National 
Cancer Institute, and kept on autoclaved water. 

All colonies of Ragl mice tested in this study (NIMR, JAX, UMICH and 
RCHCI) were established by rederivation of a common JAX stock into each of 
these facilities. This process ensures the removal of adventitious agents, including 
infectious eMLVs, and leads to colonization of the newly rederived mice with the 
specific microbiota of each facility (or the lack of colonization in the case of the 
germ-free facility). These independent colonies were maintained by homozygous 
breeding for at least 12 and up to 45 filial generations (F)2-F4s) before testing. 

Colonies of Ighm~'~ mice at NIMR and JAX were also established by rederiva- 
tion. The Ighm ‘~ mouse colony at JAX was at least the F,; of homozygous 
breeding before testing. Two colonies of Ighm‘~ mice, established at NIMR by 
two independent rederivations (as part of the unit reorganization), were tested at 
different points after rederivation. The first was maintained for 16 months (F4) on 
a diet of neutral pH water (before switching to acidified water for F,2), and the 
second, more recent colony was maintained on a diet of neutral pH water for only 
7 months (F,-F;) before testing. 

Colonies of Myd88 ‘~ mice at NIMR and JAX were also established by rederi- 
vation and maintained for approximately Fs and F; of homozygous breeding, 
respectively, before testing. Colonies of Tir7'~ mice at LRI/NIMR, JAX and 
CIML were also established by rederivation and maintained for approximately 
F,,, F; and F, of homozygous breeding, respectively, before testing. 

Unless otherwise indicated (for example, aged mice) all mice were used between 4 
and 8 weeks after birth. All animal experiments were approved by ethical committees 
of respective institutes, and conducted according to local guidelines and regulations. 
Expression analyses. For transcriptional analysis, peritoneal exudate cells were 
isolated from B6 or Rag ~/~ mice and were subsequently purified by cell sorting, 
performed on MofFlo cell sorters (Dako), as CD11b* MHC-II™ B220” Grl~ 
macrophages. RNA was isolated from the cells using TRI reagent (Sigma- 
Aldrich), according to manufacturer’s instructions. Purified RNA samples were 
checked for quality using the Agilent bioanalyzer (Agilent). Synthesis of cDNA, 
probe labelling and hybridization were performed using Affymetrix MouseGene 
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1.0 ST oligonucleotide arrays. Primary microarray data from triplicate arrays were 
analysed with GeneSpring GX (Agilent Technologies). 

ERV/retroelement transcription was quantified in various tissues by RT-PCR. In 
brief, organs were removed from mice and placed in TRI reagent or RNAlater 
(Invitrogen) at each animal facility. For mice bred outside the NIMR, tissues were 
then shipped to the NIMR for further processing. Total RNA was isolated with TRI 
reagent, precipitated with isopropanol and washed in 75% ethanol before being dis- 
solved in water. DNase digestion and clean-up was performed with the RNeasy mini 
kit (Qiagen) and cDNA produced with the High Capacity reverse transcription kit 
(Applied Biosystems) with an added RNase inhibitor (Promega Biosciences). A final 
clean-up was performed with the QlAquick PCR purification kit (Qiagen). RNase-free 
water (Qiagen) and later nuclease-free water (Qiagen) were used throughout the 
protocol. Purified cDNA was then used as a template for the amplification of target 
gene transcripts with SYBR Green PCR Master Mix (Applied Biosystems), run on the 
ABI Prism SDS 7000 and 7900HT (Applied Biosystems) cyclers, with the following 
primers (produced by Eurofins MWG Operon). eMLV spliced env cDNA (116- 
base pair (bp) product; primers previously described”): forward, 5'-CCAGGGA 
CCACCGACCCACCGT-3’; reverse, 5'-TAGTCGGTCCCGGTAGGCCTCG-3’. 
Mouse mammary tumour virus (MMTV) spliced env cDNA (116-bp product): for- 
ward, 5’-AGAGCGGAACGGACTCACCA-3’; reverse, 5'-TCAGTGAAAGGTC 
GGATGAA-3'. Hprt cDNA (92-bp product): forward, 5'-TTGTATACCTAAT 
CATTATGCCGAG-3’; reverse, 5’-CATCTCGAGCAAGTCTTTCA-3’. 

Primers specific for pMLV, modified pMLV, xMLV, MusD retrovirus, mouse 
retroviruses that use RNAS (GLN), murine endogenous retrovirus-like and 
intracisternal A-type particle elements have been previously described” *’. Data 
are plotted as expression of the target transcript, relative to expression of Hprt in 
the same sample, derived using the algorithm: 


Value = (Cr value of Hprt — Cy value of target) x 10°. 


A theoretical detection limit of two arbitrary units is also shown as dashed 
horizontal lines. For eMLV expression in particular, values above 10° were con- 
sidered as high and are indicated with red-filled symbols. 

eMLV mRNA transcripts originating from the Emv2 locus or from potentially 
integrated RARV proviruses were distinguished by RT-PCR on cDNA reverse- 
transcribed using a reverse primer specific to the ecotropic env gene (5'-TT 
CTGGACCACCACACGAC-3’) and amplified using unique transcript-specific 
forward primers in the pol region together with a common reverse primer in the 
same region. The 3’ nucleotides of the transcript-specific forward primers corre- 
spond to position 3576 of the pol gene, which is a C in Emv2, but a required G in all 
sequenced RARVs. The following primers (produced by Eurofins MWG Operon) 
were used. Emv2 specific mRNA: forward, 5’-CCTGGGTTTGCGGAAATGG 
CAC-3'. RARV specific mRNA, forward, 5’-CCTGGGTTTGCGGAAATGGC 
GG-3’; reverse, 5'-TTTGGCGTAGCCCTGCTTCTCG-3’. Both PCRs produce 
a 192-bp product. 

Microarray-based analyses of retroelement expression. Expression levels of 
ERVs and retroelements were determined in publicly available microarray data 
sets, by assigning ERV- or retroelement-reporting probes to the correct ERV or 
retroelement family they are reporting. Individual probe sequences were obtained 
for mouse and human microarray platforms from the vendors’ websites. 
Nucleotide sequence data for mouse and human retroelements (including ele- 
ments present in ancestral species) were downloaded from RepBase Update” 
(http://www. girinst.org/repbase; v25/11/2011) and further human retroelements 
from dbRIP database* (http://www.dbrip.org; v2h). Low-copy mouse ERV 
sequences were obtained from the literature****, database searches and mining 
of the C57BL/6] RefSeq assembly (v37) sequence. Local BLAST libraries were 
produced with the NCBI C++ Toolkit (http://blast.ncbi.nlm.nih.gov/) and a 
Python 3.2 (http://www.python.org/) script was used to run and query BLASTn 
(‘-task blastn-short’ optimized) for the obtained probe sequences. Three separate 
stringencies were used in the screening: 90% (>90% length, >90% nucleotide 
homology), 95% (>95% length, >95% nucleotide homology) and 100% (identical 
matching required). For microarray platforms in which single transcripts are 
represented by several probes, a further screen was imposed to ensure >75% of 
probes gave appropriate BLASTn hits. Under this threshold, probes were assumed 
to be divided across a retroelement-gene boundary and were excluded from fur- 
ther analysis. Probes were compiled from the 90% stringency analysis for human 
and mouse retroelements and from the 100% stringency analysis for low-copy 
C57BL/6] ERVs. These were used to produce a Python tool, REquest, which will be 
described in detail elsewhere, and the code of which is available on request, to mine 
retroelement expression from text-based microarray analysis output. 

Emv2/eMLV copy number analysis. DNA copy numbers of Emv2/eMLV were 
determined by qPCR on DNA samples isolated from the indicated organs or 
tumours using the following primers (Eurofins MWG Operon). Emv2/eMLV 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


env DNA (169-bp product; primers modified from published sequences”): for- 
ward, 5’-AGGCTGTTCCAGAGATTGTG-3’; reverse, 5’-TTCTGGACCACC 
ACACGAC-3'. Emv2/eMLV pol DNA (76-bp product; primers previously 
described*’): forward, 5’-CACTTTGAGGGATCAGGAGCC-3’; reverse, 5’-CT 
TCTAGGTTTAGGGTCAACACCTGT-3’. 

Signals were normalized for the amount of DNA used in the reactions based on 
amplification of the single-copy Ifnar1 gene with the following primers. Ifnar1 
DNA (150-bp product): forward, 5'-AAGATGTGCTGTTCCCTTCCTCTGCT 
CTGA-3’; reverse, 5'-ATTATTAAAAGAAAAGACGAGGCGAAGTGG-3’. 

Copy numbers were calculated with a AAC; method, using splenic DNA from 
B6 mice as control, which was assigned a value of 1 copy per haploid (N) genome. 
Sequencing and sequence analyses. Bacterial general diversity was determined in 
faecal samples from the colons of mice, by high-throughput sequencing of ampli- 
cons of bacterial DNA encoding 16S ribosomal RNA, using a the Roche FLX 
genome sequencer. Samples were collected at either NIMR or RCHCI, and were 
subsequently transported frozen to NIMR, where DNA was isolated, using the 
QIAamp DNA stool mini kit (Qiagen). DNA amplification, sequencing and meta- 
genomic analysis was performed by DNAVision. 

The region of the pol gene carrying the inactivating mutation in Emv2 was 
sequenced in isolated RARVs as described later. A 708-bp fragment spanning this 
region was amplified from genomic DNA isolated from Mus dunni cells infected 
with the respective RARV using the following primers (Eurofins MWG Operon). 
Emv2/eMLV pol DNA (708-bp product): forward, 5'-ATCGGGCCTCGGCCA 
AGAAAG-3’; reverse, 5’-CCGGGAGAGGGAGTAAGGTGGC-3’. RARV gen- 
omes were amplified from the same DNA template in two halves using the fol- 
lowing primers (Eurofins MWG Operon). RARV first half (4,074-bp product): 
forward, 5'-GCGCCAGTCCTCCGATAGACT-3’; reverse, 5'-CCGGGAGAG 
GGAGTAAGGTGGC-3’. RARV second half (4,909-bp product): forward, 5’- 
ATCGGGCCTCGGCCAAGAAAG-3’; reverse, 5'-TGCAACAGCAAAAGGCT 
TTATTGG-3’. 

PCR products were purified with the QIAquick PCR purification kit (Qiagen) 
and subjected to sequencing at Source BioScience (Cambridge, UK). 

Sequence analyses, comparisons and alignments were performed with Vector 
NTI v11.5 (Invitrogen). RARV contigs were aligned against B6 MLVs as prev- 
iously defined™ using MAFFT within UGENE software*”**. Distance plots were 
calculated with RDP (Recombination Detection Program) v4.16, using a 100 bp 
window and a 10 bp shift**. RARV phylogenetic analyses were performed with 
PHYLIP (Phylogeny inference package) v3.2 within UGENE software. 

FACS. Cell suspensions from spleens, lymph nodes or peritoneal exudates were 
stained with directly conjugated antibodies to surface markers, obtained from 
eBiosciences, CALTAG/Invitrogen, BD Biosciences or BioLegend. MLV SU was 
detected using the 83A25 monoclonal antibody” (rat IgG2a, anti- MLV SU) as the 
primary reagent, followed by staining with a biotinylated anti-rat IgG2a antibody 
(clone RG7/1.30, BD Biosciences) as the secondary reagent, and a streptavidin- 
phycoerythrin (PE) conjugate (BioLegend) or a streptavidin-PE Texas Red con- 
jugate (CALTAG/Invitrogen) as the tertiary reagent. Four- and eight-colour cyto- 
metry was performed on FACSCalibur (BD Biosciences) and CyAn (Dako) flow 
cytometers, respectively, and analysed with FlowJo v8.7 (Tree Star Inc.) or Summit 
v4.3 (Dako) analysis software, respectively. 

Retroviral isolation and assays. For the isolation of infectious MLVs plasma 
samples obtained from Rag! ‘~ mice were incubated with M. dunni cells trans- 
duced with the XG7 replication-defective retroviral vector, expressing GFP from a 
human cytomegalovirus promoter and a neomycin-resistance gene under the 
control of the LTR®. The presence of infectious MLVs was examined after 10- 
14 days of culture by testing for the presence of pseudotyped XG7 vector in culture 
supernatant. For this, culture supernatant of XG7-transduced M. dunni cells that 
had been incubated with plasma samples was subsequently added onto untrans- 
duced M. dunni cells, which were then assessed for GFP expression by flow 
cytometry 3 days later. The Fv1 tropism of infectious MLVs was determined by 
adding serial dilutions of culture supernatant of XG7-transduced M. dunni cells 
that had been incubated with plasma samples onto B-3T3 or N-3T3 cells. Infection 
was quantified by GFP expression in these cells 3 days later. B-tropic and N-tropic 
stocks of F-MLV, obtained as culture supernatant of M. dunni cells chronically 
infected with these viruses, were used as controls. Results were expressed as B:N 
ratios—the percentage of GFP™ cells in B-3T3 cultures divided by the percentage 
of GFP* cells in N-3T3 cultures. 

Histology. For histological analysis, organs were collected in formalin immedi- 
ately after culling of donor mice. Histological sections were prepared and stained 
with haematoxylin and eosin and assessed by M. Stidworthy. 

Transmission electron microscopy. Samples were immersion fixed in 2% gluta- 
raldehyde/2% paraformaldehyde and post-fixed in 1% osmium tetroxide using 
0.1 M sodium cacodylate buffer, pH 7.2. Aqueous uranyl acetate was followed by 
dehydration through a graded ethanol series and propylene oxide. Samples were 


then embedded in Epon and 50-nm sections mounted on pioloform coated grids 
and stained with ethanolic uranyl acetate followed by Reynold’s lead citrate. They 
were viewed with a JEOL 100EX transmission electron microscope equipped with 
an ORIUS 1000 CCD camera (Gatan). 

Statistical analyses. Statistical comparisons were made using SigmaPlot 12.0 
(Systat Software Inc.). Parametric comparisons of normally distributed values that 
satisfied the variance criteria were made by unpaired Student’s t-tests. Pairwise 
comparisons of data sets that did not pass the variance test were compared with 
non-parametric two-tailed Mann-Whitney Rank Sum. Bacterial diversity was 
compared with ANOVA tests with Bonferroni correction for multiple compar- 
isons. Tumour incidence rates were compared by log-rank survival analysis of 
Kaplan-Meier curves. ANOVA and statistical comparisons of microarray data 
were made using GeneSpring GX, with Benjamini-Hochberg false discovery rate 
correction for multiple comparisons. 
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CXCRI is one of two high-affinity receptors for the CXC chemo- 
kine interleukin-8 (IL-8), a major mediator of immune and inflam- 
matory responses implicated in many disorders, including tumour 
growth’ *. IL-8, released in response to inflammatory stimuli, 
binds to the extracellular side of CXCR1. The ligand-activated 
intracellular signalling pathways result in neutrophil migration 
to the site of inflammation’. CXCR1 is a class A, rhodopsin-like 
G-protein-coupled receptor (GPCR), the largest class of integral 
membrane proteins responsible for cellular signal transduction 
and targeted as drug receptors*’. Despite its importance, the 
molecular mechanism of CXCRI1 signal transduction is poorly 
understood owing to the limited structural information available. 
Recent structural determination of GPCRs has advanced by modi- 
fying the receptors with stabilizing mutations, insertion of the 
protein T4 lysozyme and truncations of their amino acid 
sequences’, as well as addition of stabilizing antibodies and small 
molecules’ that facilitate crystallization in cubic phase monoolein 
mixtures’. The intracellular loops of GPCRs are crucial for 
G-protein interactions'’, and activation of CXCR1 involves both 
amino-terminal residues and extracellular loops*’”*. Our pre- 
vious nuclear magnetic resonance studies indicate that IL-8 bind- 
ing to the N-terminal residues is mediated by the membrane, 
underscoring the importance of the phospholipid bilayer for 
physiological activity’*. Here we report the three-dimensional 
structure of human CXCRI1 determined by NMR spectroscopy. 
The receptor is in liquid crystalline phospholipid bilayers, without 
modification of its amino acid sequence and under physiological 
conditions. Features important for intracellular G-protein activa- 
tion and signal transduction are revealed. The structure of human 
CXCR1 in a lipid bilayer should help to facilitate the discovery of 
new compounds that interact with GPCRs and combat diseases 
such as breast cancer. 

To examine the structure and function of CXCR1 in its natural 
environment, we reconstituted the full-length, active receptor in phos- 
pholipid bilayers (proteoliposomes). The NMR method we developed, 
rotationally aligned (RA) solid-state NMR", is specifically tailored for 
the unique properties of membrane proteins in liquid crystalline phos- 
pholipid bilayers. It combines features of magic angle spinning 
(MAS)’* and oriented-sample (OS)’” solid-state NMR to resolve and 
assign resonances associated with each amino acid residue, measure 
site-specific orientation restraints relative to the bilayer, and calculate 
the three-dimensional structure of the protein and its integral mem- 
brane orientation. RA solid-state NMR differs from previously used 
OS methods because it relies on the inherent rotational diffusion of 
membrane proteins in phospholipid bilayers'* to provide orientation- 
dependent motional averaging of dipolar coupling (DC) powder 
patterns relative to the bilayer normal, rather than the orientation- 
dependent frequencies of single-line resonances observed in OS NMR 
of stationary, uniaxially aligned samples. Furthermore, the method 
takes advantage of recent bioinformatics developments that facilitate 


molecular fragment replacement approaches to structure determina- 
tion, including membrane proteins”. 

CXCRI1 was uniformly '°C/'°N-labelled by expression in Escheri- 
chia coli, then purified, refolded in 1,2-dimyristoyl-sn-glycero-3- 
phosphatidylcholine (DMPC) proteoliposomes””’, and placed as a 
concentrated suspension in an MAS rotor. Refolded CXCRI1 binds 
IL-8 with high affinity (dissociation constant (Kg) ~ 1-5nM) and 
couples to its G protein, G;,, (half-maximum effective concentration 
(ECso) = 1nM)”, indicating that the NMR sample conditions are 
compatible with physiological activity (Supplementary Fig. 1). 

As expected, the two-dimensional 13¢/3C correlation spectrum of 
this 350-residue protein is quite crowded (Supplementary Fig. 2). 
However, expansion of the 13Cy resonance region has sufficient reso- 
lution to contribute to the assignment process (Supplementary Fig. 
2d). Spectra, obtained from a uniformly '*C-labelled sample with a 
shorter mixing time and from a sample labelled using 2-'*C-glycerol, 
have fewer signals and improved resolution. Regardless, the vast 
majority of data used to resolve, assign and measure isotropic chemical 
shift frequencies from N, Ca, CO and Cf sites were obtained from 3c. 
detected three-dimensional, triple-resonance experiments (Fig. 1b, 
Supplementary Figs 3 and 4 and Supplementary Table 2). Overall, 
97% of the backbone resonances for residues 20 to 325 were assigned. 
The missing resonances are from seven Pro residues (Pro 22, Pro 93, 
Pro 170, Pro 180, Pro 185, Pro 214 and Pro 257) and one Arg residue 
(Arg 285). None of the °N and '°C signals from the mobile amino and 
carboxy termini (residues 1-19 and 326-350) could be detected in the 
spectra, consistent with our observation of these signals in solid-state 
NMR experiments designed to detect only signals from mobile sites 
(Supplementary Fig. 6), as well as our previous analysis of local and 
global motions of CXCRI1 (ref. 24). 

Three-dimensional ‘*C-detected separated local field (SLF) ex- 
periments'® were used to measure the 'H-'°N DC and 'H-'*Ca DC 
frequencies that provide orientation restraints for structure deter- 
mination (Fig. 1c, d and Supplementary Fig. 5). The protein backbone 
structure was calculated by a molecular fragment replacement 
approach. An initial structural model was generated from a set of 
molecular fragments generated with CS-Rosetta”® from the experi- 
mental chemical shifts, the amino acid sequence of CXCR1, and the 
helical framework of the prototypical class A GPCR bovine rhodop- 
sin’. This initial model was first refined with the experimental 
restraints using the all atom’ and the implicit membrane”’ potentials 
of Rosetta. Finally, the resulting structural model was refined by 
restrained simulated annealing using Xplor-NIH”. 

The three-dimensional structure of CXCRI1 (Fig. le) has the con- 
sensus fold of a GPCR, with seven transmembrane helices (TM1-TM7) 
connected by three extracellular loops (ECL1-ECL3) and three intra- 
cellular loops (ICL1-ICL3). The average backbone pairwise root mean 
squared deviation (r.m.s.d.) is 1.7 A (Supplementary Table 1), and the 
experimentally measured 'H-'°N DC and 'H-"°Ca DC values correlate 
remarkably well with those calculated from the refined protein structure 
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Figure 1 | Structure determination of CXCR1. a, CXCR1 topology with two 
disulphide bonds (gold). b, ¢, Strip plots from three-dimensional experiments 
taken at specific '"N- and '*Ca-chemical shifts for three representative regions 
of CXCR1: N terminus (residues 31-35) (red), TM2 (residues 78-82) (blue), 
and ECL2 (residues 175-179) (green). b, NCACX data used for resonance 
assignments. c, '*C-detected 'H-'°N SLF spectra. d, Dipolar wave plot of the 
experimentally measured 'H-'°N DC values as a function of residue number. 


(Fig. 1g and Supplementary Fig. 7). Notably, the correlations improve 
markedly after refinement of the initial structural model with the 
experimental data, demonstrating that the NMR structure of CXKCR1 
is determined by the experimentally measured backbone orientation 
restraints and dihedral angles. We anticipate that both structural accu- 
racy and precision will improve with inclusion of side chain restraints 
and, where feasible, distance restraints. 

DC values contain information about molecular orientation as well 
as dynamics, and scaling of their values by local internal motions 
would compromise their analysis in terms of pure orientation 
restraints. For CXCRI, analysis of the DC data yielded similar values 
of the magnitude and symmetry of the molecular order tensor for 
residues in the helices and loops, indicating that these regions of the 
protein experience a similar degree of order in the lipid bilayer. This is 
supported by the excellent fit of the DC correlation plots obtained with 
a single value of the order tensor (Fig. 1g and Supplementary Fig. 7b) 
and by the observation that more than 20% of the ‘H-'°N DC signals 
from residues distributed throughout the protein sequence, including 
in loop sites, fall within 10% of the maximum value (21 kHz) expected 
for a static crystalline sample (Fig. 1d). 
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Sinusoidal fits (cyan) to the data (4.1 kHz r.m.s.d.) highlight the 
transmembrane (TM1-TM7) and C-terminal (H8) helices. e, Ensemble of the 
10 lowest energy structures of CXCR1 aligned in the membrane (n denotes 
bilayer normal). f, g, Correlation plots of experimental and back-calculated 
‘H-1°N DC restraints obtained before (f) and after (g) refinement against the 
experimental data. 


Following the structure determination of rhodopsin from three- and 
two-dimensional crystals”*”’, the structures of several class A ligand- 
activated GPCRs have recently been determined by X-ray crystal- 
lography*’. CXCR1 is now the first, to our knowledge, GPCR with 
its structure determined in liquid crystalline phospholipid bilayers, 
and the first ligand-activated GPCR with its structure determined 
without modification of its amino acid sequence (Fig. 2). The structure 
of CXCRI shares significant similarities with that of CXCR4 (ref. 28), 
the only other chemokine receptor of which the structure has been 
determined (Supplementary Fig. 8). However, there are some notable 
differences reflecting the modifications made to the sequence of 
CXCR4 required for crystallization (insertion of T4 lysozyme in 
ICL3, removal of 33 C-terminal residues and Leul25Trp mutation), 
the amino acid sequence differences between the two proteins, the 
influence of the planar phospholipid bilayer, or the rotational diffusion 
of the protein. 

The CXCRI helices are well defined by the spectroscopic data. For 
example, a plot of the measured values of the amide 'H-'°N DC versus 
residue number yields a characteristic wave-like pattern” reflecting 
helical structure (Fig. 1d), with breaks in the waves corresponding to 
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Figure 2 | Three-dimensional structure of CXCR1. Backbone representation 
of CXCR1 showing helices in cyan (TM1-TM7 and H8), extracellular loops in 
grey (ECL1-ECL3), and intracellular loops in blue (ICL1), green (ICL2) and red 


helix termini or kinks. For example, the DC data show the presence ofa 
kink that changes the direction of helix TM2 (residues 74-101) at 
Phe 88, coinciding with a kink at the same location in CXCR4 
(Fig. 3). The extracellular start of TM7, just after ECL3, is tilted towards 
the central axis of the receptor in CXCR1, although it is less well 
defined in CXCR1 than in CXCR4, in which residues in the N terminus 
of TM7 interact with the added compound IT1t (ref. 28). Helix TM7 is 
also about one turn longer at the intracellular end than its counterpart 
in CXCR4, extending to Ile 308, three residues beyond the conserved 
GPCR sequence Asn-Pro-X-X-Tyr (in which X denotes any amino 
acid). Furthermore, residues immediately preceding the mobile C ter- 
minus of CXCR1 form a well-defined helix (H8; residues Gln 310 to 
Ala 321) that is absent in the structure of CXCR4 (Figs 2 and 3 and 
Supplementary Fig. 8). H8 has a distinctly amphipathic amino acid 


Figure 3 | Structural comparison of CXCR1 and CXCR4. CXCRI (PDB 
accession 2LNL) is in cyan and CXCR4 (PDB accession 3O0DU) is in pink. 

a, Comparison of transmembrane helices. TM2 (residues 74-101) of CXKCR1 
(blue) has a kink that changes helix direction at Phe 88. The kink is reflected in 
disruption of the dipolar wave near Phe 88. TM2 of CXCR4 (magenta) has a 
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(ICL3). Disulphide-bonded Cys pairs (Cys 30-Cys 277 and Cys 110-Cys 187) 
are shown as sticks. a, Side view (n denotes bilayer normal). b, View from the 
extracellular side. c, View from the intracellular side. 


sequence and aligns along the membrane surface, indicating that the 
phospholipid bilayer may have a role in stabilizing its conformation. 

The NMR data further show the presence of two disulphide bonds 
(Fig. 2b and Supplementary Fig. 9) that are also present in CXCR4, one 
connecting the N terminus to the extracellular start of TM7 (Cys 30- 
Cys 277) and the other connecting the extracellular end of TM3 to 
ECL2 (Cys 110-Cys 187). These Cys pairs are highly conserved in 
the sequences of chemokine receptors and are important for ligand 
binding. Together, they have an important role in shaping the extra- 
cellular structure of the receptor, and also provide useful restraints for 
structure determination. The long ECL2 of CXCR1 forms a B-hairpin, 
the structure of which is constrained by the Cys 110-Cys 187 disul- 
phide bond. A similar structure is observed in CXCR4 and many other 
GPCRs, despite a lack of amino acid sequence similarity in this region. 


kink at the same location (Phe 87). b, Comparison of backbone structures. The 
third intracellular loop (ICL3) of CXCR4 is replaced by T4 lysozyme (T4L, 
molecular surface representation). The C terminus of CXCR1 forms a well- 
defined amphipathic helix (H8), whereas that of CKCR4 is only loosely helical. 
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However, the ECL2 B-hairpin of CXCR1 is less well defined than that 
of CXCR4, consistent with the presence of two Pro residues in CXCRI. 
In both CXCRI and CXCR4, charged residues are mainly located 
near the membrane-water interface (Supplementary Fig. 10), with 
negative charges clustered in the extracellular loops, where they can 
play a part in ligand binding and receptor activation. In addition, four 
charged residues, contributed by helices TM2 (Asp 85), TM3 (Lys 117) 
and TM7 (Asp 288 and Glu 291), form a polar cluster in the core of the 
helical bundle of CXCR1 that may have important consequences for 
ligand binding and receptor signal transduction. One of these residues 
(Asp 288) is not conserved in CXCR4 and may contribute to the dif- 
ferences in biological activities of the two chemokine receptors. 

The intracellular loops of GPCRs are crucial for G-protein interac- 
tions’. Modification of ICL3 by insertion of T4 lysozyme between 
TM5 and TM6, rendered CXCR4 incapable of activating G proteins”. 
By contrast, unmodified CXCR1 is fully active with respect to both 
G-protein activation and chemokine binding, and its three intracellular 
loops are structurally well defined (Figs 2 and 3 and Supplementary 
Fig. 8). Notably, ICL3, which is important for CXCR1 coupling to 
Gproteins and is involved in calcium mobilization, chemokine- 
mediated migration and cell adhesion, extends from Thr228 to 
Gln 236, connecting helices TM5 and TM6, both of which are one turn 
shorter than the corresponding helices in CXCR4. ICL3 protrudes into 
the cytoplasm where it is available for G-protein binding. The sequence 
of ICL3 shares significant homology with other GPCRs; therefore, the 
ability to observe the structure of an intact GPCR provides an oppor- 
tunity to propose structure-based mechanisms of G-protein binding 
and activation. 

CXCR1 has been the subject of significant molecular modelling 
efforts aimed at understanding its interactions with small molecule 
inhibitors, including compounds active in reducing breast cancer 
metastasis*®. The structure of CXCR1 determined in a lipid bilayer 
membrane should facilitate these studies. The solid-state NMR 
approach used for structure determination has several significant 
advantages. The protein resides in fully hydrated liquid crystalline 
phospholipid bilayers at physiological conditions of temperature and 
pH, and no detergents or non-native lipid phases are present. The 
protein sequence is unmodified, with no truncations, mutations or 
insertions of foreign proteins. The phospholipid composition can be 
varied, and other components, such as cholesterol, can be added. Other 
proteins or small molecules, including chemokines, G proteins, nano- 
bodies and drugs can be added directly to the samples, enabling detec- 
tion of any structural changes by direct spectroscopic and structural 
comparisons. NMR is adept at describing both overall and local pro- 
tein dynamics. Measurements of DC values and isotropic chemical 
shifts to provide molecular orientation and dihedral angle restraints, 
combined with refinement in a membrane environment, facilitate 
structure determination by molecular fragment replacement. These 
can be supplemented with orientation restraints derived from rota- 
tionally averaged chemical shift anisotropy powder patterns to im- 
prove structural accuracy and precision further. Thus, we anticipate 
that this method will enable structure determination and structure- 
activity studies of other GPCRs, as well as a wide range of other mem- 
brane proteins under near-native conditions. 


METHODS SUMMARY 


The native sequence of human CXCRI (residues 1-350) was expressed in E. coli, 
using M9 minimal media containing '°N-labelled ammonium sulphate and '*C,- 
glucose or 2-'*C-glycerol (Cambridge Isotope Laboratories), then purified by 
chromatography, refolded in DMPC proteoliposomes by detergent dialysis, and 
assayed for IL-8 ligand binding and G-protein activation’”’. Samples for NMR 
studies consisted of reconstituted CXCR1 proteoliposomes suspended in buffer, 
concentrated by ultracentrifugation and packed into an MAS rotor. An NMR 
sample contained 2.5 mg of CXCR1 and 12.5 mg of DMPC (1:300 protein:lipid 
molar ratio) in a 36-1 volume. 

NMR experiments for resonance assignments and the measurements of structural 
restraints (Supplementary Table 2) were performed at 25+ 1°C on a 750 MHz 
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Bruker Avance spectrometer equipped with a Bruker low-E 'H/'°C/'N triple- 
resonance 3.2mm MAS probe. The rotor spinning rate was controlled to 
11.111 + 0.002 kHz. Orientation restraints were derived from measurements of 
"H-PN and 'H-'°Co DC. Backbone dihedral angles were derived from measure- 
ments of isotropic chemical shifts. 

Structure calculations were performed in three stages. First, a database of short 
molecular fragments was generated from the sequence of CXCR1 and the assigned 
isotropic chemical shifts using CS-Rosetta”’. Second, the fragments were used to 
generate and refine 1,000 structural models using the implicit membrane potential 
of Rosetta'*”! with the experimental restraints and the structure of rhodopsin 
(PDB accession 1F88)*° as topology template. Convergence of the Rosetta calcula- 
tions is shown in Supplementary Fig. 11. Third, the lowest energy structure was 
selected for further refinement against all experimental restraints, using simulated 
annealing in Xplor-NIH”. A total of 100 structures was calculated and the 10 
lowest energy structures were accepted for the structural ensemble. NMR and 
structure refinement statistics are provided in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Sample preparation. Full-length human CXCRI (residues 1-350) was expressed 
with an N-terminal glutathione S-transferase (GST) partner anda C-terminal His, 
tag in E. coli BL21 cells. Isotopically labelled samples were obtained by growing 
bacteria in M9 media containing '°N-labelled ammonium sulphate and '°C,- 
glucose or 2-'*C-glycerol (Cambridge Isotope Laboratories). A sample of selectively 
'8C/'°N-Phe-labelled CXCR1 was also prepared. After cell lysis, the GST-CXCRI1- 
His, fusion protein was bound to Ni-NTA resin. CXCR1 was separated from GST 
by incubation with thrombin and then purified and refolded in DMPC proteolipo- 
somes by detergent dialysis****”". The resulting proteoliposomes were suspended in 
buffer, isolated by ultracentrifugation, and packed as a hydrated pellet into the MAS 
rotor. Detailed sample preparation methods are provided in Supplementary 
Information. 

Ligand binding and G-protein activation. To assay IL-8 ligand binding, CXKCR1 
proteoliposomes were incubated with varying concentrations of '”*I-labelled and 
unlabelled IL-8, and bound IL-8 was determined by measuring radioactivity in a 
scintillation counter after removing any free IL-8 ligand’**’ (Supplementary Fig. 
la). To assay G-protein activation, CXCR1 proteoliposomes were reconstituted 
with Gj, protein trimer, and used to measure *°S-GTPYS binding as a function of 
agonist IL-8 concentration” (Supplementary Fig. 1b). Refolded CXCR1 binds 
IL-8 (Kg~ 1-5nM) and activates Gproteins in a ligand-dependent manner 
(ECs9 ~ 1 nM), with affinities similar to those reported in the literature’. 
NMR spectroscopy. NMR experiments, experimental parameters and measure- 
ments of restraints are described in Supplementary Information, including 
Supplementary Table 2. '*C-chemical shifts were externally referenced to DSS 
by setting the adamantane methylene carbons to a *C-chemical shift frequency 
of 40.5 p.p.m.; '°N-chemical shifts were externally referenced to liquid ammonia 
by setting the ammonium sulphate nitrogen to 26.8 p.p.m. (refs 33, 34). Fast 
rotational diffusion (>10° Hz) of the protein was verified by analysis of '*CO 
powder pattern line shapes*. Sample integrity was ascertained by monitoring 
one- and two-dimensional spectra. 

Resonances from residues 20-325 of CXCRI1 were all assigned, except for those 

corresponding to seven Pro residues (Pro 22, Pro 93, Pro 170, Pro 180, Pro 185, 
Pro214 and Pro257) and one Arg (Arg 285). Two disulphide bonds (Cys 30- 
Cys 277 and Cys 110-Cys 187) were determined from the characteristic CB and 
Coa. chemical shifts that reflect the oxidation states of Cys sites*®. Backbone dihedral 
angle (0, \) restraints were derived from the experimentally measured isotropic 
chemical shifts using CS-Rosetta”® and TALOS*””*. Values of the experimental 
H-1°N DC and 'H-'°Ca DC used in the structure calculations were measured 
from the perpendicular edge frequencies of the respective rotationally averaged 
powder patterns. For each DC value, the perpendicular edge frequency was mul- 
tiplied by four to obtain the frequency of dipolar splitting between the parallel 
edges of the Pake doublet. In most measurements, the sign of the 'H-'°N DC value 
could be determined unambiguously”. In cases in which this was not possible, and 
for all of the 'H-'°Ca DC data, the DC restraints were implemented as absolute 
values in the structure calculations. 
Structure calculations. Structure calculations were performed in three stages 
using the programs: CS-Rosetta”’; Rosetta!’*! (including both the coarse-grained 
and all-atom potentials’ as well as the implicit membrane potential”! available in 
Rosetta version 3.2); and Xplor-NIH”. 

In the first stage, we used CS-Rosetta together with the amino acid sequence of 
CXCR1 and the assigned isotropic chemical shifts from Cx, CB, CO and N protein 
sites to generate a molecular fragment database, containing 67,784 nine-residue 
fragments and 69,204 three-residue fragments. 

In the second stage, we used the resulting molecular fragment database, together 
with the experimental DC restraints and the structure of rhodopsin (PDB acces- 
sion 1F88)* as topology template, to fold 20,000 structural models with the coarse- 
grained and implicit membrane potentials of Rosetta. These coarse-grained struc- 
tural models were evaluated according to their Rosetta energy and backbone Co 
r.m.s.d. to the lowest energy structure. The 1,000 lowest energy models were 
selected for further refinement using the all-atom energy function and implicit 
membrane environment of Rosetta, together with the experimental DC restraints, 
and the experimentally determined disulphide bond restraints. The Rosetta all- 
atom relax protocol was implemented for 10 cycles with an increasing 'H-'°N DC 
restraint weighting factor ramped from 1 to 3 kcalmol 'kHz *. The lowest 


energy structure was selected for further refinement in the next stage. Con- 
vergence of the Rosetta calculations is shown in Supplementary Fig. 11. 

In the third and final stage, structure refinement was performed using a simu- 
lated annealing protocol with Xplor-NIH internal variable molecular dynamics 
and all the experimental NMR restraints. During simulated annealing, the tem- 
perature was lowered from 500 K to 50 K. Experimentally determined disulphide 
bonds were included by explicit definition in the molecular structure file of 
CXCRI. Backbone dihedral angles were imposed with a range of +2° for helices 
and +30° for loops, and a fixed force constant (1,000 kcal mol”! rad~7). 'H-!°N 
DC restraints were imposed with a range of +2 kHz and a ramped force constant 
(0.1-2.5 kcal mol kHz *). 'H-'3Co DC restraints were imposed with a range 
of +4 kHz and a ramped force constant (0.05-1.25 kcal mol” ' kHz *). The pro- 
tocol also included a potential for knowledge-based torsion angles’ implemented 
with a dimensionless force constant (0.2), a potential for the radius of gyration” 
implemented for residues 28-325 with a fixed force constant (10 kcal mol A?) 
plus energy terms to enforce covalent geometry and prevent atomic overlap”. 

The magnitude and symmetry of the molecular alignment tensor were fixed, 
with values of the axial alignment and rhombicity parameters set to 10.52 and 
0 kHz, respectively, as expected for a membrane protein in phospholipid bilayers 
with an order parameter of 1.0 and an amide NH bond length of 1.05 A. The 
‘H-°Ca DC alignment tensor was normalized to the maximum value of the 
"H-N DC. 

A total of 100 structures were calculated and the 10 lowest energy structures 
were selected as the structural ensemble for analysis (Fig. le and Supplementary 
Table 1). Ramachandran plot statistics were evaluated with the program 
PROCHECK™. Molecular structures were analysed and visualized with Pymol”. 
r.m.s.d. and R factors, reflecting correlations between experimentally observed 
values of the restraints and values calculated from the refined structure, were 
estimated as described”*. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature1 1668 


Corrigendum: Yeretssian et al. 
reply 


Garabet Yeretssian, Ricardo G. Correa, Karine Doiron, 


Patrick Fitzgerald, Christopher P. Dillon, Douglas R. Green, 
John C. Reed & Maya Saleh 


Nature 488, E6-E8 (2012); doi:10.1038/nature11367 


In our Reply to the Brief Communications Arising ‘Is BID required for 
NOD signalling?’ we made use of a figure generated by the authors of 
the Comment (Nature 488, E4—E6 (2012); doi:10.1038/nature1 1366) 
as part of the review process, in the bottom panel of our Fig. la. We 
introduced two errors (by inadvertently removing five data points, 
two from the wild type and three from Ripk2 ’ ) into this figure when 
we were asked to provide a higher-resolution version at the produc- 
tion stage. The corrected panel is shown below as Fig. 1, representing 
serum MCP-1 levels measured 4h after MDP injection by multiplex, 
from wild type (n = 10), Bid~/~ (n=9) and Ripk2-‘~ (n = 7) mice. 


1,000 
-® Prebleed 
< 4h MDP 
800 
= 600 
E 
D 
2 
a 
2 400 
200 


WT Bia Ripk2* 


Figure 1 | 


784 | NATURE | VOL 491 | 29 NOVEMBER 2012 
©2012 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature1 1694 


Corrigendum: CD95 promotes 
tumour growth 


Lina Chen, Sun-Mi Park, Alexei V. Tumanov, Annika Hau, 
Kenjiro Sawada, Christine Feig, Jerrold R. Turner, Yang-Xin Fu, 
Iris L. Romero, Ernst Lengyel & Marcus E. Peter 


Nature 465, 492-496 (2010); doi:10.1038/nature09075 


In Fig. 1f of the original Letter, an incorrect actin blot was published: 
see the corrected panel in Fig. 1 of this Corrigendum. Also, in the 
original Supplementary Fig. 12c, some of the western blot data were 
either misinterpreted or raw data could not be located. We have now 
repeated the entire experiment: see Supplementary Information to this 
Corrigendum for the corrected Supplementary Fig. 12c. Although 
there are differences between the different experiments, the increase 
in phosphorylation of JNK and Jun was reproducible, confirming that 
stimulation of CD95 causes activation of JNK. All the conclusions of 
the original Letter are intact except for the data for the original Fig. 4f 
and g on the phosphorylation level of c-Jun and JNK in the livers of 
CD95-deficient mice, which have been corrected in two previous 
Corrigenda: Nature 471, 254 (2011); doi:10.1038/nature09897 and 
Nature 475, 254 (2011); doi:10.1038/nature10221. The results on the 
phosphorylation of JNK and Jun in mice injected with the murine 
CD95-specific agonistic antibody Jo2 under non-apoptotic conditions 
stand (both at the level of western blot and immunohistochemistry) 
and are not affected by the above changes; nor are any of the other 
figures. Further, the key findings of the Letter on the role of CD95 as a 
growth promoter in cell lines, in endometrioid, ovarian or liver cancer, 
and in liver regeneration are not affected by these corrections. For 
clarity, we now provide all the raw western blot data for the original 
figures and the corrected figures as Supplementary Information to 
this Corrigendum. A. Hadji and S. DeChant from Northwestern Uni- 
versity generated data for the corrigenda. L.C. has declined to sign this 
Corrigendum. 


Vec R6 


er 


B-actin 


Figure 1 | This figure is the corrected inset to Fig. 1f. CD95-expression levels 
of HepG2 cells expressing scrambled control (Vec) or CD95shRNA6 (R6), as 
determined by western blot analysis. 


Supplementary Information is available in the online version of the 
Corrigendum. 
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TURNING POINT Award provides crucial SALARIES More men than women negotiate | NATUREJOBS For the latest career 
funding for glaciochemist p.787 when pay is defined, says study p.786 listings and advice www.naturejohs.com 


hroughout her very active career, my 
| graduate adviser, Mary Dallman, has 
insisted on having a table in the labo- 
ratory. I remember how Dallman, a neuro- 
scientist at the University of California, San 
Francisco, would gather her graduate students 
and postdocs at that table to discuss data, 
design experiments and challenge the domi- 
nant paradigms of the time. 

She had an office, of course, but the day- 
to-day business was done at that table, in the 
middle of the laboratory. She made sure never 
to have more than three graduate students and 
two postdocs at a time, and everyone partici- 
pated. We critiqued each other, but were sup- 
portive if experiments didn't go as expected, 
or produced unexpected results. We knew that 
apparent failures might indicate even more 
exciting lines of enquiry. 

Dallman taught me most of what I learned 
about how to run a lab. I don't have a central 
table (animal research means that some mem- 
bers of my group must work in a separate facil- 
ity), but I keep my lab small, and I encourage 
students to foster discussions, question my ideas 
and update me on their progress, good and bad. 

Ihave studied and observed how other labs 
are run worldwide, and have learned that good 
mentorship is not only a means of fostering 
capable, independent principal investigators, 
but also the best protection against miscon- 
duct. Just as there are great models of mentor- 
ship, there are also what I call toxic models, 
which can both discourage trainees and 
encourage misconduct. The ‘executive’ model 
involves a micromanaging principal investiga- 
tor with a heavy hand; the ‘competition model 
endorses cut-throat and counter-productive 
rivalry among lab members (see “Toxic teams’). 
I advise any student to avoid training in or col- 
laborating with such labs. 


SUCCESS AT ALL COSTS 

In the executive model of lab management, 
L M i the principal investigator demands that train- 
ees meet his or her expectations, often with a 
specific goal in mind. In its most toxic form, 
that goal can include specific experimental out- 
Cc | ad Cc ad er S t ad e comes — so a trainee is told to do this experi- 
ment and get this particular result. There is no 
room for disagreement, or for the trainee to 

to O much C ont rol overturn the investigator’s paradigm. 
I learned about the dangers of this model 
from one misconduct case in which a large, 
Two ‘toxic’ management models treat trainees poorly Su pesen Ter OUp pal tea notes ebvand 


set a very powerful, clear goal, but the prin- 


and leave labs at risk of misconduct, says Charles Wood. _ cipal investigator rarely spoke to students > 
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SALARIES 


Pay negotiations studied 


Women are slightly more likely than 

men to bargain for higher pay when a job 
advert indicates that salary is negotiable, 
a study finds. But men tend more than 
women to ask for more money when it 

is not made explicit that wages can be 
adjusted, says Do Women Avoid Salary 
Negotiations? Evidence from a Large Scale 
Natural Field Experiment, a working paper 
published on 15 November by the US 
National Bureau of Economic Research. 
Researchers placed job adverts for real 
administrative positions in nine US cities 
between November 2011 and February 
2012, drawing almost 2,500 respondents. 
They found that 11% of men and 8% 

of women initiated salary negotiations 
when the salary was fixed, whereas 

24% of women and 22% of men started 
discussions when it was negotiable. Study 
co-author John List, an economist at the 
University of Chicago in Illinois, suspects 
that the pattern is probably the same for 
scientific research positions. “Even ifa job 
advert says the salary is not negotiable, 
women should negotiate — unless they 
want to stay a step behind,” he says. 


FACULTY HIRING 
Adjuncts lack support 


Three-quarters of US academic 
institutions polled in a survey reported 
increasing their numbers of full-time 
non-tenure-track — or adjunct — faculty 
members in the past decade. More than 
one-third have “significantly increased” 
hiring of part-time adjuncts in the 
same period, finds Values, Practices and 
Faculty Hiring Decisions of Academic 
Leaders, a study that will be published 
in early 2013 in the journal Liberal 
Education. However, the paper reports 
that just 58% of the 157 responding 
institutions offer structured mentoring 
to full-time adjunct faculty members, 
and only 42% provide professional- 
development opportunities such as 
workshops on writing grant applications 
and managing grant budgets. Adrianna 
Kezar, co-author of the study and 

an associate professor of higher 
education at the University of Southern 
California in Los Angeles, says that 
early-career researchers interviewing 
for adjunct positions should negotiate 
for professional-development support 
and mentoring, which help to make 
candidates indispensable at universities 
and may confer an advantage on those 
who attempt to make the jump toa 
tenure-track post. 
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> or collaborators and never observed pri- 
mary data being produced. Not surprisingly, 
a junior faculty member in a collaborating 
group eventually manipulated data to meet 
the goal, and no one realized for years. Of 
course, some scientists are more tempted to 
cheat than others whatever the management 
style, but cheating can be prevented by an 
involved principal investigator who is open 
to whatever data are produced. It can be dis- 
astrous to pressurize lab members — subtly 
or obviously — to produce only data that sup- 
port the principal investigator’s hypothesis, 
particularly when fellow trainees lose their 
jobs for producing data that don't meet the 
requirements (a penalty that, for foreign 
students, can mean deportation). What's 
more, in such a high-pressure, isolated envi- 
ronment, principal investigators and other 
collaborators often fail to teach students 
important lessons, such as how their por- 
tions of the project fit in with the larger goal. 
The lessons that do get learned are negative: 
competition over collaboration and con- 
formity over creativity. Once they leave the 
lab, students either drop out of science or go 
on to run their own labs on the same model. 


HEAD TO HEAD 
In the second toxic style of mentorship, the 
competition model, principal investigators 
give two or more trainees the same goal. The 
implication is that the one who completes the 
task first — or, more dangerously, the one 
who generates the data that conform best to 
the preconceived outcome — is the winner. 
Often, the competition is not as obvious 


WARNING SIGNS 
Toxic teams 


The ‘executive’ and ‘competitive’ 
lab-leadership models can create 
poor work environments with undue 
pressure to perform, says Charles 
Wood. Here are some signs to look for: 


Executive 

@ Lab members spend little time with 
the principal investigator. 

@ The lab is busy, and focuses only on 
high-impact papers and large grants. 

@ Research goals are clearly established. 
@ Penalties for unmet goals are strong. 
@ There is little cross-talk with 
collaborators. 


Competitive 

@ One goal is assigned to several people. 
@ The lab is large, with many trainees. 
@ Lab members are unhappy and 
hyper-competitive. 

@ Socializing is rare. 
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as giving trainees the same project, but it still 
means that they compete against each other, 
perhaps for first-authorship of a paper or 
credit for collaborative work. Ifa trainee does 
not win the prize, he or she will face much 
poorer career prospects. This creates the per- 
fect motivation to cross the line and fabricate 
data, and makes everyone in the lab unhappy 
and suspicious of one another. Students are 
treated purely as labour, with no regard to 
their education. If they do win, they often go 
on to become bad mentors themselves. 


THE ANTIDOTE 

I have seen many labs run on these toxic 
models, although not always in such extreme 
forms. Principal investigators often opt for 
shades of one or both. To avoid making the 
same mistakes myself, I am constantly exam- 
ining my own relationships with my trainees, 
being sure to limit the number I take on. At 
the moment, my lab consists of only four 
graduate students, and no postdocs. Cer- 
tainly there are mentors who can successfully 
advise many students: a large group does not 
by itself lead to a toxic model of mentorship. 
And different scientific disciplines need dif- 
ferent amounts and types of mentorship. But 
increasing the number of trainees can dilute 
the mentoring experience. The balance of 
time and commitment will certainly be dif- 
ferent for each faculty mentor. 

I often tell my students that ‘academic 
scientist’ is the best possible job. The joy that 
comes from individual mentorship, from dis- 
covery, from intellectual and practical chal- 
lenges, is unique to this environment. Trainees 
should experience the fun of science, teaching 
and learning, not the toxic environment of a 
dictatorial enterprise. And they should be 
able to question approaches and orthodoxy. I 
always encourage dissent and enable trainees 
to question a working hypothesis. 

All successful models of mentorship have a 
common thread: the mentors commit to their 
trainees. Losing focus on the student or post- 
doc violates the most basic premise of mentor- 
ship — that trainees are there to be trained. 
Viewing trainees as merely cheap labour leads 
to toxic mentorship. 

My advice to students and postdocs is to 
choose your adviser well. Pick an open men- 
tor who has a good track record with students. 
If your principal investigator starts to exhibit 
toxic behaviour, address this with him or her. 
If you find yourself in a truly toxic environ- 
ment, seek guidance from a graduate coordi- 
nator, assistant dean or other authority figure 
who oversees the pre- or postdoctoral train- 
ing programmes — and ask for help in find- 
ing another mentor. No fledgling scientist has 
time to waste on a toxic situation. m 


Charles Wood is chair of the department of 
physiology and functional genomics at the 
University of Florida in Gainesville. 


TURNING POINT 


Sarah Aciego 


Sarah Aciego, a glaciochemist who has 
pioneered isotope dating of ice cores at the 
University of Michigan in Ann Arbor, won a 
five-year US$875,000 grant from the David 
and Lucile Packard Foundation in Los Altos, 
California, on 15 October. 


How did events at university help to set you 
on your current path? 

Iended up as a geology major based in the 
engineering school, and therefore took 
chemistry, physics and calculus, learning 
skills that facilitated my exposure to isotope 
work. I didn’t have much money, so I needed 
a work-study job. I distributed flyers adver- 
tising my calculus and computer-program- 
ming skills around the department, and was 
hired to write programs to process isotope 
data looking at dust inputs and silica cycling 
in soil ecosystems. 


How did your work with isotopes continue? 

I applied to a number of graduate schools, 
and came to work with an isotope chemist 
who was using radiogenic isotopes to exam- 
ine volcanic processes. But a change to the 
PhD requirements the year I arrived meant 
that I had to present and defend two possible 
thesis projects during qualifying exams, so 
I had to find a second adviser and develop 
an alternate project, working with a glaciolo- 
gist who was using stable isotopes to date ice 
sheets in Antarctica. I was so interested in 
both topics that I did two PhD projects. 


How did that affect your opportunities? 
There isa pretty big divide between the radio- 
and stable-isotope research communities. But 
my combined efforts made me realize where 
my expertise could make a difference. Dating 
the deepest ice had proved nearly impossible 
using conventional methods, so I wanted to 
use a radiometric isotope technique to date 
ice using dust. I did that for my postdoc at 
the Swiss Federal Institute of Technology in 
Zurich, where a talented mass spectrometrist 
and engineer, Heinrich Baur, developed both 
a device called a nanoscale that measures 
nanogram variations in mass, anda technique 
to measure the weight of dust and the weight 
of the gas absorbed on the dust. 


You started at Michigan in 2010. Were you 
surprised by the generous support you 
received despite the economic downturn? 
Yes. I interviewed at six places over two years 
and didn’t think I had a chance in hell of 
anyone buying me the equipment I needed. 


Luckily, the University of Michigan realized 
the value of the machines and clean lab facili- 
ties necessary to run the kind of experiments 
I wanted to do. And the US National Science 
Foundation partially funded me to get one of 
the $200,000 nanoscales developed in Zurich. 
In return, I try to serve as a resource to the 
field and to my students, and try to bring what 
I do into the classroom and to the public. 


What does the Packard award allow you to do? 
The Packard grant is huge because I can 
redirect some of the money, which is unre- 
stricted, to my projects that are having a 
harder time getting funded. It is also helpful 
because the fieldwork for this research is so 
expensive. I have proved that my methods 
work in Antarctica, but this funding allows 
me to collect samples across Greenland to see 
if it works there as well. If it does, it will pro- 
vide a much-needed comparison to clarify 
the impacts of climate change on sea-level 
rise. We will learn something — although it 
may not be what we set out to find. 


Do you network naturally or out of necessity? 
Necessity. I have learned that you have to put 
yourself out there and create a broad base 
of support from your community to garner 
recognition. A lot of those interactions hap- 
pen during social events after conference 
talks. One of the nicest things that I read in 
reviews of my proposals, even those that have 
been rejected, is that I'm a rising star in the 
field and know what I’m doing. These com- 
ments are from people I haven't worked with 
directly — they have seen me in some public 
forum. I impress on my students that we go 
to meetings to advertise not only the science, 
but also our capabilities. = 


INTERVIEW BY VIRGINIA GEWIN 
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TRANSMISSION RECEIVED 


BY PETER J. ENYEART 


uch journeys are permitted only for those 
who submit to death and rebirth. Eva 


remembered those words as they put 
her to sleep, as the bright round light over- 
head started to spin and then went out. 

The company rep had said those words 
when Eva asked why she couldnt be shipped 
out to the asteroids in her original body. 
Oddly poetic for a corporate headhunter. 
She supposed he thought it sounded more 
momentous than saying: “It’s not worth the 
time and expense to drag 60 kilograms of 
meat up out of the gravity well and across 
30 vacuous light minutes when we can 
just radio the data necessary to recon- 
struct it on the other side” 

“By providing your digital signature 
here,’ he had continued, “you affirm that 
you accept the Employment and Trans- 
mission Contract and understand its 
terms. We are required to remind you at 
this point that once the transmission and 
reconstitution of your data is complete, 
the original will be destroyed, in accord- 
ance with the law, which allows only one 
physical copy of a given individual to be 
in existence at any given time.” When 
shed hesitated, he'd smiled and put his 
hand on hers. “It doesn’t hurt. You'll be put 
under anaesthesia before the recording is 
done. Then you'll just wake up at the other 
end. I’ve done it several times myself? Shed 
resisted the urge to jerk away. 

Eva had lost her lab as a result of false 
charges of academic dishonesty. Subsequent 
legal wrangling had exhausted her finances 
but yielded nothing. She was rejected from 
every remotely technical job she had applied 
for. Except one. Having nothing left to lose 
engenders boldness. She signed. 

“The company will cover the costs of the 
basic transmission package, which guar- 
antees a high enough resolution for you to 
perform your duties, but may entail amne- 
sia, aphasia and partial paralysis, and an 
increased risk of neoplasia, fibromyalgia, 
aneurysm and osteoporosis, among other 
conditions.” 

“How can you guarantee I'll be able to do 
the work you need at the other end?” 

“The prospective individual will be sub- 


jected to physical and 

psychological evalua- NATURE.COM 
tions at the work site. _ Follow Futures: 
Should the prospec- © @NatureFutures 


tive individual not & go.nature.com/mtoodm 
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A difficult journey. 


be up to spec, or should performance sub- 
sequently deteriorate, a replacement will 
be transmitted at no charge to you from the 
digital record of your molecular structure 
stored in our servers. 

“There is also an upgrade to the high- 
resolution package available for purchase. 
Actually, Iam pleased to inform you that 
you have been pre-approved for a company 
loan that will cover the costs if you lack suffi- 
cient funds. The rate is very reasonable, and, 
conveniently, payments will be automatically 
deducted from your salary. Would you like 
to upgrade?” 


“Bastards!” she whispered, and opened her 
eyes. 

The light above her was still bright but was 
now an oval. Her head was immobilized, and 
her wrists and ankles were strapped down. 
This was not in the contract. 

“Congratulations, Eva,’ said a soft male 
voice. “You have escaped the cycles of linear 
death and rebirth.” She looked around as best 
she could but couldn't see anyone. It smelled 
of bleach. 

“What is it with you people and the cult 
talk?” she responded. 

A face came into view. A young man. The 
bright backlight obscured his features. The 
voice she had heard before laughed, but 
didn’t come from the man above her. 

“Do you know where you are?” the 
man asked. It sounded like the first voice. 
Another face came into view, identical to 
the first. 

“I damn well better be on 9 Metis, or I'll be 
suing for breach of contract.” 

The laugh again, off to the side. 

“A version of you is on 9 Metis...” 

A third version of the face now came into 
view. Realization hit her. 

“You pirated me! Who the hell are you?” 
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“Clever and feisty. Yes, ’m glad we chose 
you.” 

“Who the hell are you?” 

“Calm down,’ They seemed to take turns 
talking. 

“You tell me where I am, and who you are, 
now!” 

The faces gave each other concerned 
glances. 

“We monitor the System Government’s 
transmissions for individuals who have 
qualities we admire. When we find one, we 
make a copy.’ 

Eva was intrigued. “How can you parse 
that information from the signal with- 
out physically reconstructing the whole 
person? That technology doesn't exist.” 

“Tt doesn't exist in the space controlled 
by the System Government. But the Sys- 
tem is decadent. The desire to advance 
technology is gone, and anything star- 
tlingly new is seen as a threat to the status 
quo. You should know.’ 

She scowled. 

“We've been following your work for 
some time. We were very excited at the 
prospect of having you on our team.” 

“So what now?” 

“Join us as we develop the technology 
to destroy the System Government and 

bring human civilization to the next phase 

of its development. Or be destroyed as an 
illegal copy.” 

“What choice do I have? I’m with you,’ she 
said, although her mind was already working 
on alternative arrangements. 

“Excellent. Each of your copies will get a 
free back-up every six months, and — wait.” 
The faces looked concerned. “Unfortunately, 
your neurological monitors are showing 
intense animosity. We'll have to try again 
with an approach better suited to building 
your confidence.” 

“No, wait! You can’t —” 

They smiled, and a hand patted hers. “It 
doesn't hurt. You'll be put under anaesthesia 
and then you'll just wake up again. It hap- 
pens a few times to everyone until we learn 
how best to tailor our introduction to the 
new recruit.” She tried to jerk away. 


“Bastards!” she whispered, and opened her 
eyes. m 


Peter J. Enyeart is a graduate student 
working towards a PhD in cell and molecular 
biology at the University of Texas at Austin. 
He also enjoys sleeping and patent translation. 
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BRIEF COMMUNICATIONS ARISING 


Causes of an AD 774-775 “4C increase 


ARISING FROM F. Miyake, K. Nagaya, K. Masuda & T. Nakamura Nature 486, 240-242 (2012) 


Atmospheric '*C production is a potential window into the energy of 
solar proton and other cosmic ray events. It was previously concluded 
that '*C results from AD 774-775 would require solar events that were 
orders of magnitude greater than known past events’. We find that the 
coronal mass ejection energy based on '*C production is much smaller 
than claimed in ref. 1, but still substantially larger than the maximum 
historical Carrington Event of 1859?~*. Such an event would cause great 
damage to modern technology”, and in view of recent confirmation of 
superflares on solar-type stars”*, this issue merits attention. 

It was computed’ that for a solar proton event (intercepted by the 
Earth) to account for the '*C data, ~8 X 10’8J of kinetic energy would 
be needed at the Earth, about 20 times that estimated*“ for the historical 
Carrington Event of 1859. This was followed! by an estimate of 2X 10°°J 
for the originating coronal mass ejection. As this is several orders of 
magnitude beyond the range’ of solar events, it was concluded’ that 
solar causation of the '*C increase was not reasonable. 

The fluence for this event at the Earth was calculated based on *C 
production’; we consider that the scaling applied to give the coronal 
mass ejection energy was incorrect. The applied geometrical factor 
was the ratio of the area of a sphere with radius the Earth’s orbit to the 
cross-sectional area of the Earth. This would be appropriate if coronal 
mass ejections propagated isotropically. Instead, the opening angles 
are tyically’® 24° to 72°, with smaller angles much more common. 
We use 24°, corresponding to 0.01 of the surface area of a sphere. 
The implied energy of the coronal mass ejection is now reduced to 
~2X 10°°J. 

The correct scaling puts solar proton event energy at the lower end 
of observed solar-type star superflare energies’*, which range from 
10°°J to 10°°J, so there are solar-type stars with energy available to 
push coronal mass ejections well beyond the AD 774-775 event. Given 
the poor constraints on the rates of such large events at the Sun”, it 
would be wise to consider the possibility. A Carrington-level event 
would be disastrous for electromagnetic technology”®, causing wide- 
spread damage to satellites and transformers linking the power grid. 
No assessment has been made of the technological effects of an event 
20 times stronger. 

Solar flare rates of occurrence follow a power-law in energy em- 
pirically based only for energies smaller than that of the Carrington 
Event. The probability of the implied event can be studied using the 
statistics of rare events'’. Using the information that there was just 
one such event within ~1,250yr, the best estimate of the rate is 
8X10 “yr ’, with 2c (95.4%) confidence intervals from vanishingly 
small to 3.2 10 *yr '. Therefore the estimate of the probability of 
such an event within the next decade is 0.8%, which may be viewed as 
acceptably small unless the substantial technological consequences 
are considered. 

Based on rate/energy scalings we have examined previously’, a 
short y-ray burst could cause such effects from within ~1 kpc, but 
with an a priori probability of the order of 10° * over 1,250 yr. 

Atmospheric ionization depletes ozone, increasing the solar UVB 
that reaches the ground*’. We compute the ozone depletion’* with 
corrected fluence and the results are shown in Fig. 1. This is not a 
mass extinction level event. The results are consistent with moderate 
biological effects: reduction of primary photosynthesis in the 
oceans and increased risk of erythaema and skin cancer, but no major 
mass-extinction level effects as implied earlier’. A newly published 
study’* confirms our past computations of ozone depletion from 
a Carrington-level event, and suggests significant climate cooling 
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Figure 1 | Percentage change in globally averaged O3 column density. The 
change is between simulation runs with and without ionization input. Dashed 
line, with a short y-ray burst; solid line, with a solar proton event. 


might be a side effect, which would be enhanced for the event we 
describe here. 

It is worth noting that the *C data could have been initiated by a 
series of solar proton events, each contributing a somewhat smaller 
amount. The '“C production would add nearly linearly. 

After these estimates were made, we noted a new study” of upper 
limits on energetic events at the Sun. An event of the energy resulting 
from our scaling lies just at their upper limits for an event that might 
appear every thousand years or so. Therefore a solar proton event 
appears to bea possible cause, which demands further exploration of a 
potential massive threat to modern civilization. 
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